Skip to main content

Cast AI

Category: AI Infrastructure

An AI-powered cloud optimization and GPU orchestration platform that automates Kubernetes management and provides a unified marketplace for high-performance compute. Cast AI was founded in 2019. The company is led by Yuri Frayman. Based in Miami, USA. Team size: 200-500. Total funding raised: $181M total (all rounds). Latest round: Series C ($108M, Apr 2025 led by G2VP, SoftBank Vision Fund 2) - Kubernetes cloud cost optimization. Key investors include ["Pacific Alliance Ventures","SoftBank Vision Fund 2","Vintage Investment Partners","Creandum","Samsung Next","Cota Capital","G2 Venture Partners","Aglaé Ventures","Hedosophia","Uncorrelated Ventures","Metanet"].

Founded
2019
Headquarters
Miami, USA
Team size
200-500
Total funding
$181M total (all rounds)

Value proposition

Automates cloud infrastructure management to reduce costs by an average of 63% while solving GPU scarcity through a unified, cross-provider compute marketplace.

Products and solutions

["Kubernetes Cost Optimization (Automated Rightsizing)","OMNI Compute (Unified GPU Marketplace)","AI-driven Autoscaling & Spot Instance Management","Cloud Security & Compliance Monitoring","Multi-cloud Infrastructure Orchestration"]

Unique value

Moves beyond 'read-only' cost reporting to 'active-automation,' where AI makes real-time infrastructure changes; features the industry's first unified marketplace for GPU compute.

Target customer

DevOps teams, AI/ML engineers, and enterprise organizations running large-scale Kubernetes clusters on AWS, Azure, or Google Cloud.

Industries served

["Artificial Intelligence & Machine Learning","Cloud Computing & SaaS","Fintech","E-commerce","Enterprise Software"]

Technology advantage

Proprietary AI algorithms for predictive scaling and automated spot instance fallback, combined with OMNI Compute technology that abstracts diverse GPU hardware into a single accessible pool.

How they differentiate

Differentiates through 'active automation' rather than just visibility; the platform autonomously performs real-time rightsizing and spot instance management. It also features a unique unified GPU marketplace (OMNI Compute) to solve hardware scarcity for AI workloads.

Main competitors

["Spot by NetApp","Kubecost","Harness","Zesty"]

Key partnerships

["AWS","Microsoft Azure","Google Cloud","Oracle Cloud Infrastructure (OCI)","Pacific Alliance Ventures","SoftBank Vision Fund 2","Metanet","HuggingFace","Samsung Next","Vintage Investment Partners"]

Notable customers

["Akamai","BMW","Cisco","FICO","HuggingFace","NielsenIQ","Swisscom","Samsung","TGS","Branch","Iterable"]

Major milestones

["Achieved Unicorn status ($1B valuation) in January 2026","Launched OMNI Compute, the industry's first unified GPU marketplace","Surpassed 2,100 enterprise customers globally","Named a Cool Vendor by Gartner in Cloud Computing","Opened new offices in Bangalore, London, New York, Tel Aviv, Dallas","Secured strategic investment from Pacific Alliance Ventures (Jan 2026)","Secured investment from Metanet and credit facility from J.P. Morgan (Oct 2025)"]

Growth metrics

Reached a $1 billion valuation in January 2026; serves over 2,100 enterprise customers globally; offices in 7+ countries including US, Lithuania, UK, India, Israel.

Market positioning

A dominant leader in the Kubernetes cost optimization and AI infrastructure space, recently achieving unicorn status.

Geographic focus

Global, with a primary focus on North America and Europe (HQ in Miami, Florida, with significant R&D operations in Lithuania).

Patents and IP

Proprietary AI-driven automation and orchestration logic; specific public patent registrations not disclosed in latest funding updates.

About Yuri Frayman

Yuri Frayman is a serial entrepreneur with over 30 years of experience and a track record of five successful exits totaling over $750 million. His notable exits include Zenedge (acquired by Oracle), Viewdle (acquired by Google), The Frayman Group (acquired by Intapp), and LegalKEY (acquired by Hummingbird). He also co-founded CUJO AI and has extensive expertise in cloud computing, cybersecurity, and AI-driven automation.

Official website: