Reducto just raised $24.5M in Series A funding to help enterprises unlock unstructured data with near-perfect accuracy.
AI teams today are bottlenecked by messy, real-world documents—so Reducto built the most accurate parsing pipeline in the industry. By combining vision-language models with agentic workflows, Reducto turns complex PDFs and scanned documents into structured, LLM-ready data.
Now trusted by companies like Scale AI, Vanta, and top AI teams, Reducto has parsed over 250 million pages and is expanding into full end-to-end pipelines: document splitting, classification, structured extraction, and more. With their new Agentic OCR framework, they’re pushing toward human-level accuracy—automating what used to take teams days, in seconds.
YC Partner Diana Hu recently sat down with the Reducto founders to talk about how they got here, their founding story, and the kind of company they are building.
Learn more about Reducto at https://reducto.ai.
Apply to Y Combinator: https://ycombinator.com/apply
Chapters (Powered by ChapterMe) -
00:00 - Data-driven AI for large enterprises
01:17 - Document management
03:04 - Simplify PDF processing for companies
03:59 - Aha moment for PDF extraction, interesting approach
05:02 - NLP-based PDF extraction for enterprise apps
06:56 - Great data, exciting use cases
08:10 - Best places for customer approaches
08:48 - Closing a Fortune 25 deal in just two months
11:21 - Data-driven AI for high-quality documents
13:19 - Reductos AI-focused infrastructure attracts top companies
15:18 - Quality of data, results, support