Skip to main content

Gimlet Labs

Category: AI Infrastructure

A multi-silicon inference cloud platform designed to optimize and orchestrate complex agentic AI workloads across heterogeneous hardware architectures. Gimlet Labs was founded in 2023. The company is led by Zain Asgar. Based in San Francisco, USA. Team size: 11-50. Total funding raised: $92.0M. Latest round: Series A (Mar 2024). Key investors include Menlo Ventures, Factory, Eclipse, Prosperity7, Triatomic, Lip-Bu Tan, Raghu Raghuram, Bill Coughran, Nick McKeown.

Founded
2023
Headquarters
San Francisco, USA
Team size
11-50
Total funding
$92.0M

Value proposition

Eliminates AI inference bottlenecks and reduces operational costs by dynamically routing workloads to the most efficient hardware (Nvidia, AMD, or specialized AI chips) without requiring manual code refactoring.

Products and solutions

Multi-Silicon Inference Cloud (Core Platform), Agentic Workload Orchestrator, Hardware-Agnostic AI Runtime, Real-time Inference Performance & Cost Dashboard

Unique value

Unlike traditional cloud providers that lock users into specific GPU instances, Gimlet provides a 'liquid' infrastructure that abstracts the underlying silicon, allowing AI agents to hop between different chip architectures mid-workflow for optimal performance.

Target customer

Enterprise AI developers, LLM-based application builders, and companies deploying 'agentic' AI systems that require high-scale, multi-step inference.

Industries served

Artificial Intelligence & Machine Learning, Cloud Infrastructure, Enterprise Software, Financial Services, Healthcare Technology

Technology advantage

Leverages proprietary hardware-software co-design to solve the 'agentic bottleneck'—where multi-step AI tasks are slowed by latency and high costs—by using an intelligent scheduling layer that treats diverse hardware as a single unified pool of compute.

How they differentiate

Proprietary multi-silicon orchestration that abstracts hardware (Nvidia, AMD, NPUs), allowing agentic AI workloads to dynamically route to the most efficient chip mid-workflow without code changes.

Main competitors

Together AI, Anyscale, CoreWeave, Lambda Labs

Key partnerships

Menlo Ventures (Lead Investor), Factory, Eclipse, Prosperity7, Triatomic, Strategic hardware relationships with major GPU and NPU manufacturers

Notable customers

Enterprise AI developers, LLM-based application builders, Agentic AI system providers

Major milestones

Emerged from stealth in March 2024 with $80M Series A, Secured strategic backing from former CEOs of VMware and Cadence, Developed the world's first multi-silicon inference cloud for agentic AI

Growth metrics

Reported eight-figure annual recurring revenue (ARR) upon emerging from stealth in early 2024.

Market positioning

Infrastructure-as-a-Service (IaaS) and orchestration layer for high-scale agentic AI deployments.

Geographic focus

Global, with primary operations in North America (Palo Alto, CA).

Patents and IP

Proprietary orchestration logic and hardware abstraction layers (specific public filings not disclosed, but core IP centers on multi-silicon scheduling).

About Zain Asgar

Co-founder and former CEO of Pixie Labs (acquired by New Relic); GPU architect at NVIDIA; Engineering Lead at Google AI (Google Lens); Director Engineering at Trifacta (employee #6); Adjunct Professor of Computer Science at Stanford University; PhD in Electrical Engineering from Stanford.

Official website: