Motadata AIOps Health Monitoring
The Health Screen Monitoring module in Motadata AIOps serves as a comprehensive hub for monitoring various aspects of your Motadata AIOps deployment. Acting as a centralized point of access, it allows users to monitor live sessions, database and cache statistics, upgrade details, and restore past backup versions of Motadata AIOps. This feature streamlines maintenance tasks and provides a holistic view of the health and performance of Motadata AIOps, regardless of deployment type.
Tabs Overview
Tab | Description |
---|---|
Health Overview | Provides an overall snapshot of the Motadata AIOps deployment's health. |
Database | Offers insights into the database performance and statistics. |
Live Session | Monitors active user sessions in real-time. |
Alert | Tracks alerts generated on the AIOps artefacts such as the application server, database, and collector. |
Restore | Facilitates the restoration of past backup versions of Motadata AIOps. |
Upgrade | Displays details and enables upgrades of Motadata AIOps. |
- Health Overview
- Application
- Database
- High Availability
- Live Session
- Alert
- Restore
- Upgrade
Health Overview
In the Health Overview section, users can access various widgets to assess the overall health of their Motadata AIOps deployment:
Widget | Description |
---|---|
Deployment Status | Provides an overview of the health status of all installed artifacts, including the Master Application Server, database, and collectors. Users can quickly identify the status of each artifact within the deployment. |
Engine Statistics | Offers detailed statistics on various engines within the Motadata AIOps deployment, such as Metric Poll, Notification, Config Create, Rediscover, and Event Policy. Users can monitor metrics such as dropped events, pending events, queued events, finished events, and idle workers. |
Monitors (Polling Issue) | Highlights any polling issues encountered by monitors within Motadata AIOps. It provides details on monitors facing polling failures, including group, tags, monitor type, and the reason for the polling failure. This widget enables users to promptly identify and address polling issues to ensure uninterrupted monitoring capabilities. |
Application
The Application section within the Health Monitoring screen provides vital insights into the JVM (Java Virtual Machine) performance. This section offers detailed information on various aspects of the JVM, allowing users to monitor and assess the health of the application runtime environment.
From the top right of the screen, users can select the entity (e.g., primary application server, secondary application server) from the dropdown menu to view statistics specific to that application server. Once an entity is selected, the statistics on the page will reflect the performance metrics of the chosen application server.
Besides this dropdown, users have the option to download the diagnostics data from this page for further analysis.
Field | Description |
---|---|
Thread Count | Provide the details about the current number of active threads within the JVM. |
Daemon Threads | Shows the count of daemon threads that are running in the JVM. |
Heap Memory | Provides the amount of used and committed heap memory currently utilized by the JVM. |
Non-Heap Memory | Provides the amount of used and committed non-heap memory currently utilized by the JVM. |
Init Memory | Indicates the initial memory allocation for the JVM at startup. |
Max Heap Memory | Displays the maximum amount of heap memory allocated to the JVM. |
Garbage Collection Summary | Presents a grid view of the Garbage Collection (GC) summary, including information on the count and duration of GC events. |
Non-Heap Memory Trend | Offers a graphical representation of the non-heap memory usage trend for the current day's timeline. |
Heap Memory Trend | Shows the trend of heap memory usage, including today’s data and a comparison of the past 7 days. |
By monitoring these metrics, users can maintain optimal JVM performance and ensure that the application server is running efficiently.
Database
The Database section within the Health Screen Monitoring module of Motadata AIOps offers comprehensive insights into the health and performance of various databases associated with your AIOps deployment. Users can select the database of interest from the dropdown menu, including primary, secondary, or replica databases, depending on their AIOps deployment configuration.
Users can select the entity (e.g., primary database, secondary database, replica database) from the dropdown menu to view statistics specific to the selected database. The metrics on the page will update based on the database chosen.
Besides this dropdown, users have the option to download the diagnostics data for the selected database directly from this page.
Widget | Description |
---|---|
Cache Details Widget | Presents essential metrics related to the cache performance of the selected database, including cache entry count, cache hit ratio, cache touches, and cache average access time. These metrics provide valuable insights into the efficiency and effectiveness of the database cache. |
Graphical Widget | Displays graphical representations of key database performance metrics, enabling users to visualize trends and patterns over time. Metrics such as top queries by execution time, pending queries, total queries, pending files to sync, and query latency are graphically represented for enhanced analysis. |
By leveraging these widgets, users can gain a comprehensive understanding of the database health and performance of the selected database within their Motadata AIOps deployment.
High Availability
The High Availability section within the Health Monitoring screen provides detailed insights into the Observer in your deployment scenario. This tab is crucial for monitoring the health and performance of entities connected to the observer, ensuring seamless connection within your infrastructure.
Field | Description |
---|---|
Observer Connection Summary | Displays a grid view of the connection details for all entities connected to an observer, including the IP address of that server, observer IP connected to it, type of the server, connection duration, and connection status. This summary helps monitor the current state of the observer's connections and identify potential issues. |
Sync Statistics Summary | Presents a grid view of the sync statistics for every entity connected to an observer. Key metrics include pending events, synced events, total events, and the specific engine for which the observer will perform sync. |
This section allows users to monitor the overall synchronization and connection health of their setup, ensuring that all entities remain in sync and perform optimally.
Live Session
The Live Session tab within the Health Screen Monitoring module of Motadata AIOps provides real-time insights into the active user sessions currently running on the platform. Users can conveniently monitor live sessions, accessing details such as the remote IP address, browser information, operating system, and user type for each active session.
Field | Description |
---|---|
Type | Identifies the type of user logged into each session. |
Remote IP | Displays the IP address from which the session is initiated. |
OS | Indicates the operating system running on the user's device. |
Browser | Provides details about the web browser used to access the session. |
Duration | Indicates the time for which the session is running. |
Alert
In the Alert section of the health monitoring screen, users can access detailed information about alerts raised on servers used for AIOps deployment, including application servers, databases, and collectors. Here, users can find:
Field | Description |
---|---|
Policy Name | The name of the policy that triggered the alert. |
Metric | The specific metric for which the policy is triggered. |
Value | The value of the metric at which the alert is triggered. |
Instance | If applicable, the specific instance for which the alert is triggered. |
This section provides users with comprehensive insights into the alerts generated within their AIOps deployment, allowing for prompt action and efficient monitoring of system health.
Restore
Refer Restoring Backups to learn about the Restore functionality.