Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained 2.2.4

Prefix Reserved
There is a newer version of this package available.
See the version list below for details.

Requires NuGet 3.3.0 or higher.

dotnet add package Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained --version 2.2.4                
NuGet\Install-Package Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained -Version 2.2.4                
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained" Version="2.2.4" />                
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained --version 2.2.4                
#r "nuget: Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained, 2.2.4"                
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained as a Cake Addin
#addin nuget:?package=Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained&version=2.2.4

// Install Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained as a Cake Tool
#tool nuget:?package=Microsoft.ServiceFabricApps.ClusterObserver.Windows.SelfContained&version=2.2.4                

ClusterObserver 2.2.4

This version requires SF Runtime >= 9.0 and targets .NET 6. .NET Core 3.1 is no longer supported.

ClusterObserver (CO) is a stateless singleton Service Fabric .NET 6 service that runs on one node in a cluster. CO observes cluster health (aggregated) and sends telemetry when a cluster is in Error or Warning. CO shares a very small subset of FabricObserver's (FO) code. It is designed to be completely independent from FO sources, but lives in this repo (and SLN) because it is very useful to have both services deployed, especially for those who want cluster-level health observation and reporting in addition to the node-level user-defined resource monitoring, health event creation, and health reporting done by FO. FabricObserver is designed to generate Service Fabric health events based on user-defined resource usage Warning and Error thresholds which ClusterObserver sends to your log analytics and alerting service.

By design, CO will send an Ok health state report when a cluster goes from Warning or Error state to Ok.

CO only sends telemetry when something is wrong or when something that was previously wrong recovers. This limits the amount of data sent to your log analytics service. Like FabricObserver, you can implement whatever analytics backend you want by implementing the IObserverTelemetryProvider interface. As stated, this is already implemented for both Azure ApplicationInsights and Azure LogAnalytics.

The core idea is that you use the aggregated cluster error/warning/Ok health state information from ClusterObserver to fire alerts and/or trigger some other action that gets your attention and/or some SF on-call's enagement via auto-creating a support incident (and an Ok signal would mean auto-mitigate the related incident/ticket).

As of version 2.2.0.831/960, ClusterObserver supports the FabricObserver extensibility model. This means you can extend the behavior of ClusterObserver by writing your own observer plugins just as you can do with FabricObserver.

You can change ClusterObserver configuration parameters by doing a versionless Application Parameter Upgrade. This means you can change settings for CO without having to redeploy the application or any packages.

Application Parameter Upgrade Example:

  • Open an Admin Powershell console.

  • Connect to your Service Fabric cluster using Connect-ServiceFabricCluster command.

  • Create a variable that contains all the settings you want update:

$appParams = @{ "RunInterval" = "00:10:00"; "MaxTimeNodeStatusNotOk" = "04:00:00"; }

Then execute the application upgrade with

Start-ServiceFabricApplicationUpgrade -ApplicationName fabric:/ClusterObserver -ApplicationTypeVersion 2.2.1.960 -ApplicationParameter $appParams -Monitored -FailureAction rollback

Example Configuration:

<?xml version="1.0" encoding="utf-8" ?>
<Settings xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://schemas.microsoft.com/2011/01/fabric">
	<Section Name="ObserverManagerConfiguration">
		
		<Parameter Name="ObserverLoopSleepTimeSeconds" Value="" MustOverride="true" />
		
		<Parameter Name="ObserverExecutionTimeout" Value="" MustOverride="true" />
		
		<Parameter Name="ObserverLogPath" Value="" MustOverride="true" />
		
		<Parameter Name="EnableVerboseLogging" Value="" MustOverride="true" />
		<Parameter Name="ObserverFailureHealthStateLevel" Value="" MustOverride="true" />
		<Parameter Name="EnableETWProvider" Value="" MustOverride="true" />
		<Parameter Name="ETWProviderName" Value="" MustOverride="true" />
		<Parameter Name="EnableTelemetryProvider" Value="" MustOverride="true" />
		<Parameter Name="EnableOperationalTelemetry" Value="" MustOverride="true" />

		

		<Parameter Name="AsyncOperationTimeoutSeconds" Value="120" />
		
		<Parameter Name="TelemetryProvider" Value="AzureLogAnalytics" />
		
		
		
		<Parameter Name="AppInsightsInstrumentationKey" Value="" />
		
		<Parameter Name="AppInsightsConnectionString" Value="" />
		
		
		
		<Parameter Name="LogAnalyticsWorkspaceId" Value="" />
		
		<Parameter Name="LogAnalyticsSharedKey" Value="" />
		
		<Parameter Name="LogAnalyticsLogType" Value="ClusterObserver" />
		
		
		<Parameter Name="ObserverShutdownGracePeriodInSeconds" Value="1" />
	</Section>
	
	<Section Name="ClusterObserverConfiguration">
		
		<Parameter Name="AsyncOperationTimeoutSeconds" Value="" MustOverride="true" />
		
		<Parameter Name="Enabled" Value="" MustOverride="true" />
		
		<Parameter Name="EnableEtw" Value="" MustOverride="true"/>
		<Parameter Name="EnableVerboseLogging" Value="" MustOverride="true" />
		<Parameter Name="EnableTelemetry" Value="" MustOverride="true" />
		
		<Parameter Name="EmitHealthWarningEvaluationDetails" Value="" MustOverride="true" />
		
		<Parameter Name="MaxTimeNodeStatusNotOk" Value="" MustOverride="true" />
		
		<Parameter Name="RunInterval" Value="" MustOverride="true" />
		
		<Parameter Name="MonitorRepairJobs" Value="" MustOverride="true" />
		
		<Parameter Name="MonitorUpgrades" Value="" MustOverride="true" />
	</Section>
	
</Settings>

Example LogAnalytics Query

alt text

Product Compatible and additional computed target framework versions.
.NET net6.0 is compatible.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 was computed.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version Downloads Last updated
2.3.0 185 8/7/2024
2.2.8 242 1/31/2024
2.2.6 449 10/9/2023
2.2.5 385 8/14/2023
2.2.4 454 4/27/2023
2.2.3 464 3/17/2023
2.2.2 501 3/7/2023
2.2.1.960 579 9/27/2022
2.2.1.831 651 9/27/2022
2.2.0.960 726 7/13/2022
2.2.0.831 784 7/12/2022
2.1.14 761 3/15/2022
2.1.13 710 2/9/2022
2.1.12 5,824 11/23/2021
2.1.11 519 10/11/2021
2.1.10 547 7/13/2021
2.1.9 561 5/6/2021
2.1.8 529 4/27/2021
2.1.7 516 4/16/2021
2.1.6 483 4/7/2021
2.1.5 669 3/10/2021
2.1.4 493 2/25/2021

- Performance and Code improvements.