Overview
Join us for a 60-minute deep dive into batch inference — the most cost-effective way to run large-scale AI workloads. Whether you're processing millions of documents, generating content at scale, or running evaluation pipelines, batch processing can save you 40–60% compared to real-time API calls.
What You'll Learn
- Batch API Architecture: How BatchIn's batch processing engine works under the hood — job queuing, automatic retry, and result delivery.
- Cost Optimization Strategies: Real-world techniques to minimize token spend while maximizing throughput.
- Customer Case Study: How a fintech company processes 2M+ documents daily with 55% cost reduction using BatchIn Batch API.
- Live Demo: Step-by-step walkthrough of submitting a batch job, monitoring progress, and downloading results.
- Roadmap Preview: Upcoming features including scheduled batches, webhook notifications, and priority queuing.
Agenda
| Time (ET) | Session |
|---|---|
| 11:00 AM | Welcome & Introduction |
| 11:05 AM | Batch API Deep Dive |
| 11:25 AM | Customer Case Study: FinTech at Scale |
| 11:40 AM | Live Demo: End-to-End Batch Workflow |
| 11:50 AM | Q&A with Engineering Team |
| 12:00 PM | Wrap-up & Next Steps |
Speakers
- Andy Wang — Co-founder & CEO, BatchIn. Will share the product vision and batch-first strategy.
- Kevin Zhang — Lead Infrastructure Engineer, BatchIn. Will demo the Batch API and share architecture insights.
FAQ
Q: Is this webinar free? A: Yes, completely free. Register to receive the Zoom link.
Q: Will there be a recording? A: Yes. All registrants will receive a recording link within 24 hours after the event.
Q: Can I ask questions during the webinar? A: Absolutely! We'll have a dedicated Q&A session at the end, and you can drop questions in the chat anytime.
Q: Do I need a BatchIn account to follow along? A: Not required, but recommended. Sign up for free and get $50 in credits to try the Batch API yourself.