AI Apparatus: An Open Marketplace for AI Agent Skills and MCP Servers
Introducing ai-apparatus, a fork-friendly marketplace for sharing IDE skills and MCP servers across Cursor, Claude, VS Code, and other AI coding agents.
Technical articles on cloud architecture, DevOps, Kubernetes, and platform engineering.
Introducing ai-apparatus, a fork-friendly marketplace for sharing IDE skills and MCP servers across Cursor, Claude, VS Code, and other AI coding agents.
Lessons from running Qwen, Gemma, and GLM models on local NVIDIA 3080 GPUs with opencode and aider — understanding costs, risks, and how to design guardrails for enterprise use cases.
Team structure, responsibilities, and success metrics for an organizational AI Center of Excellence.
A workflow-based container architecture for running AI agents in production clusters with strict sandboxing, no external tool access, and VPC-bounded outputs.
Common pitfalls when scaling AI experiments into production workflows and how to avoid them.
How to evaluate if an organization is ready for AI adoption across infrastructure, skills, and culture dimensions.
SOC2, HIPAA, and PCI implications when code and data flow through AI coding assistants.
Policies for acceptable use, data handling, model selection, and vendor management in enterprise AI adoption.
RBAC for AI agents, data governance, audit trails, and prompt logging for enterprise AI adoption.
Decision framework for choosing between self-hosted models and API-based services based on volume, latency, privacy, and cost.
Key metrics for AI systems - latency, token usage, quality signals, cost per task, and drift detection.
Inference economics, token-based billing, and cost attribution by team and project for AI workloads.
How Model Context Protocol changes agent architectures and what platform teams need to provide.
Scheduling, quotas, and capacity planning when teams run local inference models on Kubernetes.
How to enable dynamic route selection in Envoy after modifying headers in a Go filter
How to build standardized, secure workflows for AI tool adoption with scoped access, sandboxes, and approval gates.
A robust Azure-based network architecture to secure Kubernetes public endpoints by enforcing VPN traversal, centralizing security, and enhancing auditability.
Learn how to implement secure internet breakout for remote workers using Azure Virtual WAN.
A practical guide to cloud cost optimization in Kubernetes using FinOps principles and Kubecost.
A comprehensive guide to building a scalable, automated MLOps pipeline using AWS SageMaker and Terraform, covering architecture, CI/CD, and infrastructure as code best practices.