Skip to main content

David AI

Category: Voice / Speech AI

An audio data research company providing high-quality, proprietary datasets to train and power speech and conversational AI models. David AI was founded in 2024. The company is led by Tomer Cohen. Based in San Francisco, USA. Team size: 1-10. Total funding raised: $80.5M. Latest round: Series B. Key investors include ["Alt Capital","Amplify Partners","BoxGroup","First Round Capital","Y Combinator"].

Founded
2024
Headquarters
San Francisco, USA
Team size
1-10
Total funding
$80.5M

Value proposition

David AI addresses the critical bottleneck in AI development by providing the high-quality, diverse, and research-grade audio data necessary to build more accurate and capable voice and speech recognition models.

Products and solutions

["Proprietary, research-grade audio datasets","Custom dataset creation and curation","Audio data analytics and intelligence"]

Unique value

David AI is unique due to its singular focus on audio data, treating dataset development with the same research and development rigor that AI labs apply to model creation. This specialization allows them to create superior, nuanced audio data (e.g., full-duplex, channel-separated, emotionally rich) that is not readily available.

Target customer

AI developers, researchers, and enterprises, with a focus on leading AI labs and technology companies developing voice-based AI systems.

Industries served

["Artificial Intelligence / Machine Learning","Voice Technology","Conversational AI","Technology"]

Technology advantage

The company's key advantage is solving the urgent and growing demand for high-quality audio training data, a major constraint for the advancement of voice AI. By providing a simple, effective solution to this data scarcity, they have positioned themselves as a critical infrastructure provider for some of the largest technology companies.

How they differentiate

David AI focuses on providing high-quality, research-grade audio datasets specifically for training speech and conversational AI models. They aim to solve the problem of fragmented and limited-scale audio data by offering proprietary datasets that are superior to open-source alternatives like Common Voice.

Main competitors

["Gretel Technologies","Hazy","Wow AI"]

Key partnerships

["While specific names are not publicly disclosed, David AI is reported to have partnerships with several of the 'Magnificent Seven' technology companies.","Y Combinator (as part of their accelerator program)"]

Notable customers

["While specific customer names are not disclosed, David AI is trusted by and provides data to many of the leading audio AI labs and major tech companies."]

Major milestones

["Accepted into Y Combinator's S24 batch.","Raised a $5M Seed Round led by First Round Capital.","Raised a $25M Series A co-led by Alt Capital and Amplify Partners.","Secured an additional $50 million in a new funding round."]

Growth metrics

Specific growth metrics such as revenue or user numbers are not publicly available. However, the company has seen significant investor interest, indicating strong market traction.

Market positioning

David AI is positioned as a premium provider of audio data for top-tier AI labs and large tech companies. They are known for their specialized, high-quality datasets that cater to the advanced needs of AI developers.

Geographic focus

While the company is based in San Francisco, USA, their focus is global, serving the needs of AI development teams worldwide.

Patents and IP

Information not publicly available.

About Tomer Cohen

Previously Head of GTM & Ops for Generative AI and Chief of Staff to the CEO at Scale AI, and a consultant at McKinsey & Company.

Official website: