Gimlet Labs
Category: AI Infrastructure
A multi-silicon inference cloud platform designed to optimize and orchestrate complex agentic AI workloads across heterogeneous hardware architectures. Gimlet Labs was founded in 2023. The company is led by Zain Asgar. Based in San Francisco, USA. Team size: 11-50. Total funding raised: $92.0M. Latest round: Series A (Mar 2024). Key investors include Menlo Ventures, Factory, Eclipse, Prosperity7, Triatomic, Lip-Bu Tan, Raghu Raghuram, Bill Coughran, Nick McKeown.
- Founded
- 2023
- Headquarters
- San Francisco, USA
- Team size
- 11-50
- Total funding
- $92.0M
Value proposition
Eliminates AI inference bottlenecks and reduces operational costs by dynamically routing workloads to the most efficient hardware (Nvidia, AMD, or specialized AI chips) without requiring manual code refactoring.
Products and solutions
Multi-Silicon Inference Cloud (Core Platform), Agentic Workload Orchestrator, Hardware-Agnostic AI Runtime, Real-time Inference Performance & Cost Dashboard
Unique value
Unlike traditional cloud providers that lock users into specific GPU instances, Gimlet provides a 'liquid' infrastructure that abstracts the underlying silicon, allowing AI agents to hop between different chip architectures mid-workflow for optimal performance.
Target customer
Enterprise AI developers, LLM-based application builders, and companies deploying 'agentic' AI systems that require high-scale, multi-step inference.
Industries served
Artificial Intelligence & Machine Learning, Cloud Infrastructure, Enterprise Software, Financial Services, Healthcare Technology
Technology advantage
Leverages proprietary hardware-software co-design to solve the 'agentic bottleneck'—where multi-step AI tasks are slowed by latency and high costs—by using an intelligent scheduling layer that treats diverse hardware as a single unified pool of compute.
How they differentiate
Proprietary multi-silicon orchestration that abstracts hardware (Nvidia, AMD, NPUs), allowing agentic AI workloads to dynamically route to the most efficient chip mid-workflow without code changes.
Main competitors
Together AI, Anyscale, CoreWeave, Lambda Labs
Key partnerships
Menlo Ventures (Lead Investor), Factory, Eclipse, Prosperity7, Triatomic, Strategic hardware relationships with major GPU and NPU manufacturers
Notable customers
Enterprise AI developers, LLM-based application builders, Agentic AI system providers
Major milestones
Emerged from stealth in March 2024 with $80M Series A, Secured strategic backing from former CEOs of VMware and Cadence, Developed the world's first multi-silicon inference cloud for agentic AI
Growth metrics
Reported eight-figure annual recurring revenue (ARR) upon emerging from stealth in early 2024.
Market positioning
Infrastructure-as-a-Service (IaaS) and orchestration layer for high-scale agentic AI deployments.
Geographic focus
Global, with primary operations in North America (Palo Alto, CA).
Patents and IP
Proprietary orchestration logic and hardware abstraction layers (specific public filings not disclosed, but core IP centers on multi-silicon scheduling).
About Zain Asgar
Co-founder and former CEO of Pixie Labs (acquired by New Relic); GPU architect at NVIDIA; Engineering Lead at Google AI (Google Lens); Director Engineering at Trifacta (employee #6); Adjunct Professor of Computer Science at Stanford University; PhD in Electrical Engineering from Stanford.
Official website: https://gimletlabs.ai/