Search

Showing top 113 results for "Platform and dashboard"

People also ask

What is the GPU Usage Monitor?

The GPU Usage Monitor is an open-source project that deploys a fully integrated GPU observability stack for Kubernetes. Rather than requiring SRE and platform teams to assemble and configure individual components, the GPU Usage Monitor uses DCGM Exporter, kube-state-metrics, Prometheus, and Grafana into a single deployment, complete with pre-built dashboards designed specifically for GPU-accelerated workloads. The design principle is operational simplicity. A single helm install command results in actionable GPU visibility within minutes, with no custom dashboard authoring or scrape configurat

Get Real-Time Visibility into GPU Usage Across Kubernetes Clusters | NVIDIA Technical Blog

Discussions and forums

r/selfhosted · u/narrow-adventure · 3w ago

MIT-licensed Sentry + Datadog replacement, self-hosts in ~90 seconds

Hi, I've been working on an open-source observability stack that is really easy to self host. About 6 months ago I got super frustrated by paying for Sentry and hosting a bunch of services (otel collector, prometheus, gr…

Hacker News · u/mrcoldbrew · 1w ago

Show HN: InsForge – Open-source Heroku for coding agents

Hi HN, I'm Hang, cofounder of InsForge (YC P26). InsForge is an open-source Heroku for AI coding agents: a backend platform designed for coding agents to deploy, operate, and debug end-to-end. Open source under Apache 2.…

62 7
Hacker News · u/Magnanten · 1w ago

Show HN: Superlog (YC P26) – Observability that installs itself and fixes bugs

Hey HN, we’re Nico and Arseniy, co-founders of Superlog (https://superlog.sh). We're building a self-installing, self healing observability tool meant not to be opened. It has a wizard that daily sets up proper logging a…

73 49
r/devops · u/BuffaloJealous2958 · 2w ago

How do you deal with engineers who refuse to touch the actual workflow/process side?

I have a couple really strong engineers on the infra/platform side who are honestly great technically. Fast problem solvers, reliable during incidents, know the systems deeply, people trust them. But they absolutely hate…

r/netsec · u/Huge-Skirt-6990 · 2w ago

WaSteal: 126 Chrome extensions, 148K installs, one Brazilian operator silently sending WhatsApp user data and ad cookies to its servers

126 Chrome extensions, all secretly the same product, taking 148K users' WhatsApp data and ad cookies A Brazilian company (wascript.com.br) runs one platform that 126 different Chrome extensions all share. They look like…