Cloud Infrastructure Engineer at Alchemy Inc.

Company: Alchemy Inc.

Location: San Francisco

Type: FULL_TIME

Apply for this position

Job Description

<p style="min-height:1.5em"><strong>About the Role</strong></p><p style="min-height:1.5em">As an engineer in the Infrastructure department at Alchemy, you will design, deploy, and continuously improve the infrastructure powering our blockchain developer platform — serving 100+ chains, billions of daily requests, and over $150B in annual transactions.</p><p style="min-height:1.5em">The Infrastructure team provides the infrastructure, tooling, and expertise needed to allow Alchemy engineers to ship, scale, and operate high-quality products in a fast, safe, and cost-efficient manner.</p><p style="min-height:1.5em"><strong>What You'll Do</strong></p><ul style="min-height:1.5em"><li><p style="min-height:1.5em">Architect and operate scalable, self-healing infrastructure leveraging Kubernetes, Terraform, and cloud-native tools across multi-region deployments.</p></li><li><p style="min-height:1.5em">Drive AI enablement across engineering — ensuring repos, tooling, and workflows are optimized for agentic development with tools like Claude Code, Cursor, and Codex.</p></li><li><p style="min-height:1.5em">Build AI-powered infrastructure tooling and automation (e.g., automated K8s upgrades, IaC plan analysis, cost optimization advisors, MCP servers, n8n workflows).</p></li><li><p style="min-height:1.5em">Build and maintain internal developer platform (IDP) capabilities for self-service deployments, observability, and reliability.</p></li><li><p style="min-height:1.5em">Develop observability frameworks using Prometheus and Grafana for metrics, dashboards, and alerting.</p></li><li><p style="min-height:1.5em">Lead incident management with blameless post-mortems; define and enforce SLIs, SLOs, and error budgets across services.</p></li><li><p style="min-height:1.5em">Design and manage multi-cloud, multi-region network architecture — VPC design, IPAM, DNS (Cloudflare), cross-cloud connectivity, security groups, and edge-proxy/istio gateway configuration.</p></li><li><p style="min-height:1.5em">Collaborate with security teams to embed compliance into infrastructure, including IaC scanning and runtime protection.</p></li><li><p style="min-height:1.5em">Provide technical leadership and mentorship to elevate the team's operational capabilities.</p></li></ul><p style="min-height:1.5em"><strong>What We're Looking For</strong></p><ul style="min-height:1.5em"><li><p style="min-height:1.5em">5+ years as an Infrastructure Engineer focused on reliability (SRE, Production Engineer, Platform Engineer).</p></li><li><p style="min-height:1.5em">Experience driving company-wide reliability efforts, including SLO frameworks and error budget policies.</p></li><li><p style="min-height:1.5em">Strong proficiency with observability stacks: OpenTelemetry, Prometheus/Grafana.</p></li><li><p style="min-height:1.5em">Deep experience with cloud infrastructure (AWS/GCP), Kubernetes, and multi-region architectures.</p></li><li><p style="min-height:1.5em">Skilled with Terraform, Helm, and GitOp

Browse More Jobs

Priority job-market routes

Explore exact-match crypto job pages with stronger market coverage, salary context, and fresh protocol hiring inventory.