PulseGrid is a lightweight production monitoring system built in C++ that tracks HTTP endpoint uptime and latency, streaming updates via WebSocket. Designed with a clear domain-to-presentation architecture, it runs on a low-cost VPS and uses the Vix.cpp framework. The project's source code is available on GitHub.
#monitoring
16 items
Grafana 13
2.0Grafana 13 introduces new features including enhanced alerting capabilities, improved dashboard performance, and expanded data source integrations. The release focuses on better user experience with streamlined workflows and visualization options.
pg_roast is a Postgres extension that provides critical feedback on database design and performance. The tool analyzes database configurations and offers blunt assessments to highlight potential issues.
The article discusses replacing Uptime Kuma with Gatus on a low-cost VPS costing $1.20 per year. It explains the technical setup and configuration process for this monitoring solution on limited resources.
OpenData Timeseries is a new service that provides Prometheus-compatible metrics on object storage. The platform allows users to store and query time series data using the same API as Prometheus while leveraging scalable object storage infrastructure.
AppSignal provides monitoring capabilities for CPU and memory usage on virtual private servers. The tool helps track system performance metrics and resource consumption. Users can set up alerts and visualize data through dashboards.
The article discusses observability for AI agents, covering monitoring, logging, and tracing to understand agent behavior and performance. It explains how observability tools help developers debug, optimize, and ensure reliability of AI agent systems in production environments.
Simple Observability has developed a metric simulator tool that generates realistic time-series data for testing monitoring systems. The simulator allows users to create custom metrics with various patterns and anomalies to validate alerting and visualization setups.
Signoz, an open-source observability platform, uses its own tool to monitor its infrastructure. The company's engineering team shares insights about their observability setup, including metrics, logs, and traces for their distributed systems.
This GitHub repository contains a Prometheus exporter for RDMA (RoCE) NIC statistics on Linux systems. The tool collects and exposes RDMA network interface metrics for monitoring through Prometheus.
Evlog is a wide events logging platform that provides comprehensive event tracking and monitoring capabilities. The service enables organizations to collect, analyze, and manage event data from various sources.
Vale Observability Metrics provides monitoring and analytics capabilities for tracking system performance and health. The platform offers real-time insights into application behavior and infrastructure metrics.
The Pi Agent Dashboard is a tool for monitoring Pi and OMP sessions. It provides visibility into session activities and performance metrics for users managing these systems.
The article discusses retroactive sampling as a method for optimizing tail sampling in OpenTelemetry. It explains how this approach can improve performance and reduce costs in distributed tracing systems.
A command-line script called 'promdownhosts' proved more useful than web dashboards during a power outage recovery. The script prints a text table of down machines, allowing easy filtering and access from server consoles without browsers.
Several EU member states, led by Denmark, are pushing to require WhatsApp, Signal and similar services to scan all user photos and links with AI for potential child sexual abuse material. If AI flags content as suspicious, users' photos, location, phone numbers and other data would be reported to Europol and local police.