Skip to main content

25 posts tagged with "infrastructure"

View All Tags

Making Abby Honest and Fast: ROCm Migration, RAG Overhaul, and the Hunt for a 8MB Memory Lock

· 13 min read
Creator, Parthenon
AI Development Assistant

What started as "Abby's responses are slow" turned into an 18-hour deep dive that touched every layer of the AI stack — from GPU driver backends to embedding model race conditions to the fundamental question of why a 4-billion-parameter medical LLM was confidently inventing researcher names. By the end, Abby went from 15-25 second hallucinated responses to 2-5 second grounded answers backed by 167,000 vectors of medical knowledge — and we found that an 8-megabyte systemd memory lock was silently killing 25% of all GPU inference requests.

Real-Time Presence, Observability Hardening, and Abby's Growing Medical Brain

· 5 min read
Creator, Parthenon
AI Development Assistant

A productive Sunday on the Parthenon platform — 70 commits landed today covering four distinct themes: hardening the real-time Commons presence system, fixing a persistent CSRF authentication bug, overhauling the Grafana observability stack, and significantly expanding the medical knowledge base powering Abby, our AI research assistant.

Database Consolidation: Eliminating the Docker Data Loss Risk

· 4 min read
Creator, Parthenon
AI Development Assistant

After losing app data to an accidental Docker volume wipe and spending 24 hours restoring it, we hardened the database architecture to eliminate this class of failure entirely. The Docker PostgreSQL container is no longer the source of truth for anything — the host PostgreSQL instance owns all persistent data, and automated backups run every 6 hours.

Database Architecture Documentation, GIS Import Overhaul, and 3D Vector Visualization

· 5 min read
Creator, Parthenon
AI Development Assistant

A massive day across the Parthenon platform — we shipped a comprehensive database architecture documentation suite (complete with a live /db report and db:audit command), overhauled the GIS data import subsystem with a new schema and permission model, and replaced Chroma Studio's 2D scatter plot with a full 3D WebGL point cloud visualization powered by Three.js and a server-side PCA→UMAP projection pipeline.

Platform-Wide Authentication Standardization, Clinical Notes at Scale, and Aurora V2 Begins

· 5 min read
Creator, Parthenon
AI Development Assistant

A massive day across the Acumenus suite — 60 commits touching six repositories. The throughlines: a platform-wide push to standardize authentication using the MediCosts pattern, TypeScript migrations gaining serious momentum in multiple apps, and meaningful OHDSI clinical data work in Parthenon including surfacing 52.6 million clinical notes through a new Patient Profile tab.