4-Week Live Bootcamp

Build an Enterprise SLM.
From scratch, end to end.

A 4-week live bootcamp taught by Dr. Sreedath Panat (MIT PhD). Take a small language model end to end, from raw data through tokenization, pretraining, fine-tuning, alignment, inference engineering, and deployment.

Starts June 25, 2026Thursdays & Saturdays · 9-11 AM IST

Can't attend live? All sessions are recorded for lifetime access.

“If you want to understand LLMs beyond API calls, the best way is to build one from the ground up.”

Enroll Now ›View Curriculum

Live Lectures

Weeks

Per Lecture

Capstone Project

Scroll

Why Enterprise SLMs

Small models are the next
enterprise standard.

67% of enterprises prefer on-premise AI. Open-source SLMs deliver 10x cost savings over cloud APIs. The shift from massive models to efficient, domain-specific ones is accelerating.

$68B

Enterprise AI market by 2030

Grand View Research›

$4.6B

VC investment in open-source AI (2024)

Crunchbase›

67%

Enterprises prefer on-premise AI deployment

McKinsey›

10x

Cost reduction with SLMs vs cloud LLM APIs

a16z›

Enterprise SLM Market Size

Open-Source & On-Premise AI - Global

44% CAGR

$1.2B

2022

$2.8B

2023

$5.5B

2024

$9B

2025

$14B

2026

$28B

2028

$45B

2030

Source: Grand View Research · Markets and Markets

Open-Source AI Unicorns & Funding

The open-weight SLM ecosystem is exploding

Mistral AI

$6.2B valuation

Databricks

$62B valuation

Together AI

$3.3B valuation

Cerebras

$4B valuation

Anyscale

$1B valuation

Lightning AI

$800M valuation

Sources: TechCrunch · Crunchbase

“I think AGI may come in under 1 billion parameters.”

Karpathy has argued that true general intelligence could emerge from small, efficiently trained models - not trillion-parameter behemoths. This is exactly the thesis behind enterprise SLMs.

— Andrej Karpathy, YouTube

Industry Leaders

The people building AI
are betting on small.

From Eureka Labs to Meta to HuggingFace - the consensus is clear. Smaller, domain-specific models are the future of enterprise AI.

“LLM model size competition is intensifying... backwards! My bet is that we'll see models that 'think' very well and reliably that are very very small.”

Andrej Karpathy

Founder, Eureka Labs

“My prediction: in 2024, most companies will realize that smaller, cheaper, more specialized models make more sense for 99% of AI use-cases.”

Clement Delangue

Co-founder and CEO, HuggingFace

“My main research goal is to make AI accessible so that everyone can tinker with AI, learn from it, and integrate it and use it in their own work.”

Tim Dettmers

Creator, bitsandbytes and QLoRA

“We have a handful of frontier labs that determine this set of AI models that are served the same way to everyone, and they're very expensive to adapt. I think that doesn't need to be true anymore.”

Sara Hooker

Founder, Adaption Labs

“Open source AI foundation models will wipe out closed and proprietary AI models, for the same reason Wikipedia wiped out generalist commercial encyclopedia.”

Yann LeCun

Chief AI Scientist, Meta

“If you want to understand LLMs beyond API calls, the best way is to build one from the ground up.”

Sebastian Raschka

Author, Build a Large Language Model (From Scratch)

Who is this for?

Built for engineers who want to go deep.

Engineers transitioning into LLM engineering, NLP, or enterprise AI
Developers building AI-powered products - chatbots, search, document Q&A
Engineers who want to go beyond API calls - to building and training models
Researchers who need production engineering depth alongside theory

What you will achieve

Leave production-ready.

Enterprise AI interview question:

“Design an enterprise SLM system that fine-tunes on private data, supports RAG with permission-aware retrieval, and deploys on-premise with sub-100ms latency. Walk me through the architecture.”

Asked at companies building enterprise AI products. You will have a complete answer.

Build, train, and deploy enterprise SLMs from scratch - transformers, SFT, RLHF, quantization
Design efficient training pipelines with LoRA, QLoRA, and distributed training
Implement RAG, tool calling, and agentic workflows with small models
Deploy models with vLLM, llama.cpp, and production inference servers
Build industry-level portfolio projects from hands-on capstone work

02What You'll Learn

The complete toolkit.
One SLM bootcamp.

Go from zero to building, training, and deploying enterprise-grade small language models in Python.

Transformer internals

Understand every layer of the transformer: attention, feedforward, RMSNorm, and rotary positional encoding from first principles.

Data curation and tokenization

Source, clean, and deduplicate a corpus. Train tokenizers from scratch and format data for pretraining, SFT, and preference tuning.

Pretraining from scratch

Train a language model from a blank slate. Implement the full training loop, apply scaling-law economics, and read the loss curve.

Fine-tuning with LoRA and QLoRA

Run supervised fine-tuning and parameter-efficient adaptation. Tune ranks, quantize to 4-bit, and decide fine-tune vs RAG vs prompt.

Preference tuning and guardrails

Align models with reward modeling, DPO, and RLHF. Add moderation, refusals, and prompt-injection defenses on top.

Evaluation and retrieval (RAG)

Build evaluation suites with LLM-as-judge and golden sets. Stand up retrieval with hybrid search, reranking, and grounded citations.

Inference engineering

Reason about prefill versus decode, KV-cache memory, and quantization. Serve with paged attention and continuous batching, then benchmark.

Deployment and operations

Ship endpoints and streaming APIs. Add observability, drift detection, canary rollouts, and a retrain lifecycle for production.

How SLMs Are Built

The full training pipeline.
End to end.

From raw data to deployed model: every stage of building an enterprise-grade small language model.

Raw Data

Web crawl, books, code

Clean & Format

Dedup, filter, chunk

training

Tokenizer

BPE / SentencePiece

Embeddings

Learned vectors

Multi-Head Attention

Add & Norm

Feed-Forward

Add & Norm

Transformer Block

Next-token prediction

Pre-training

Cross-entropy loss

👍👎

SFT + RLHF

Alignment & safety

Hello, I am a SLM

Inference

KV Cache, vLLM, Quantize

Raw Data

Web crawl, books, code

Clean & Format

Dedup, filter, chunk

training

Tokenizer

BPE / SentencePiece

Embeddings

Learned vectors

Multi-Head Attention

Add & Norm

Feed-Forward

Add & Norm

Transformer Block

Next-token prediction

Pre-training

Cross-entropy loss

👍👎

SFT + RLHF

Alignment & safety

Hello, I am a SLM

Inference

KV Cache, vLLM, Quantize

01Curriculum

8 days of building.
One complete education.

Each day builds on the previous. By the end, you'll have built a complete enterprise SLM from scratch.

The cost, latency, and privacy case for small models
Attention, feedforward, RMSNorm, and RoPE from first principles
MQA and GQA, parameter counting, and decoding strategies
Scaling laws and the case for compact 100M to a few-billion models
GPU memory math: how model size drives deployment cost
Build a minimal transformer forward pass and read its next-token distribution

04Tools & Models

The tools that power
production SLMs.

You won't just learn theory — you'll build with the same tools used to train and deploy real enterprise language models.

PyTorchCore Framework

HuggingFaceTransformers & Datasets

DeepSpeedDistributed Training

bitsandbytesQuantization

LoRA / PEFTEfficient Fine-Tuning

vLLMInference Serving

Weights & BiasesExperiment Tracking

llama.cppLocal Deployment

PythonCore Language

PyTorchCore Framework

HuggingFaceTransformers & Datasets

DeepSpeedDistributed Training

bitsandbytesQuantization

LoRA / PEFTEfficient Fine-Tuning

vLLMInference Serving

Weights & BiasesExperiment Tracking

llama.cppLocal Deployment

PythonCore Language

Capstone Project

Build something
you can actually ship.

The bootcamp ends in one capstone: a small language model for legal contracts, built end to end and deployable on-prem.

A confidential legal contract assistant

Take a small language model from raw contracts to a running, on-prem endpoint. It reads an agreement, extracts and classifies the clauses, flags privilege and personal data for e-discovery, and answers questions grounded in the document with a citation to the exact clause.

Clause extraction and classification, trained on annotated contracts (CUAD)
Privilege and PII detection for e-discovery
Retrieval with citations back to the exact clause
Quantized and deployed on-prem, so confidential files never leave the building

Dr. Sreedath Panat

MIT PhD · Vizuara AI Labs

Your Instructor

Dr. Sreedath Panat

MIT PhD · Co-founder & Director, Vizuara AI Labs

Dr. Sreedath holds a PhD from MIT and is the co-founder and director of Vizuara AI Labs. An IIT Madras graduate and department gold medalist, he has built a 200K+ subscriber YouTube channel and co-authored the Manning bestseller “Build DeepSeek from Scratch”.

His teaching philosophy: visual intuition first, mathematical rigour second, hands-on implementation always. Every concept is taught from scratch - no hand-waving.

Have questions? Reach out at sreedath@vizuara.com

All 8 core lectures personally delivered
PhD from MIT - rigorous technical foundation
IIT Madras graduate & department gold medalist
Winner of the Langmuir Award
Co-author of Manning bestseller "Build DeepSeek from Scratch"
200K+ YouTube subscribers · 115K+ LinkedIn followers

LinkedIn (115K+)YouTube (200K+)Google Scholar

Research Starter Kit

Start your research with a head start.

Don't start from scratch. Tell us your topic of interest and we'll generate a personalised research roadmap and an initial version of your research paper - delivered asynchronously, so you can hit the ground running from day one.

What's in the kit

Personalised Research Roadmap (PDF)

You tell us your topic of interest. We generate an 8-week structured plan with milestones, deliverables, and acceptance criteria - tailored to your specific SLM research area. Includes literature review scope, training pipeline design, experiment matrix, and manuscript timeline. Delivered asynchronously.

Initial Research Paper Draft

We generate an initial version of your research paper - research questions framed, methodology outlined, related work surveyed, and experiment setup defined. You don't start with a blank page - you start with a 6-8 page scaffold ready to build on. Delivered asynchronously based on your topic.

Curated Paper Reading List

12-15 handpicked papers relevant to your topic with reading order, key takeaways, and connections between papers. Includes a literature matrix template for systematic tracking.

Starter Code Template

A clean, documented codebase scaffold for your SLM research project - model loading, training loop, evaluation pipeline, and experiment config. Ready to run on day one.

Example research topics

Your roadmap is personalised to your background and goals. Here are some SLM topics our students have worked on:

Knowledge Distillation from Large to Small Language Models

Efficient Pre-training with Data Pruning and Curriculum Learning

Domain-Adaptive Fine-Tuning for Enterprise Legal/Medical SLMs

Quantization-Aware Training for Sub-1B Parameter Models

On-Device Language Models for Mobile and Edge Deployment

Retrieval-Augmented Generation with Small Language Models

Multi-Task SLMs for Enterprise Document Processing

Constitutional AI and Safety Alignment for Enterprise SLMs

08Pricing

Build your workshop

Select what you need. Everything adjusts instantly.

Step 1 — Choose your program

Step 2 — Or pick a bundle and save

Your Workshop

Select a program to get started.

Select a program to continue

What mentorship includes

Fully async - personalized feedback at every stage, no calls required. See our published research.

Target: Publishable Paper

The goal is a research paper. Your mentors guide you from topic selection through experiments to a publication-ready manuscript.

Every Step Guided

Literature review, experiment design, ablation studies, writing - your mentors walk you through every step of the research process so you never feel stuck.

Industry + Research Exposure

Get career strategy and deep research guidance. Both industry and academic perspectives in one mentorship.

Paper Reading Guidance

Curated reading lists, paper discussion, and feedback on how to extract and apply insights from the literature.

Actionable Next Steps

Every interaction ends with clear deliverables and deadlines. You always know exactly what to do next.

Ready to build your
enterprise SLM?

Join the 8-day bootcamp and go from zero to building, training, and deploying enterprise small language models from scratch.

Build an Enterprise SLM.From scratch, end to end.

Small models are the nextenterprise standard.

Enterprise SLM Market Size

Open-Source AI Unicorns & Funding

The people building AIare betting on small.

Built for engineers who want to go deep.

Leave production-ready.

The complete toolkit.One SLM bootcamp.

Transformer internals

Data curation and tokenization

Pretraining from scratch

Fine-tuning with LoRA and QLoRA

Preference tuning and guardrails

Evaluation and retrieval (RAG)

Inference engineering

Deployment and operations

The full training pipeline.End to end.

Raw Data

Clean & Format

Tokenizer

Embeddings

Transformer Block

Pre-training

SFT + RLHF

Inference

Raw Data

Clean & Format

Tokenizer

Embeddings

Transformer Block

Pre-training

SFT + RLHF

Inference

8 days of building.One complete education.

Transformer internals and the small-model case

Data, curation, and tokenizers

Pretraining from scratch

Fine-tuning: SFT, LoRA, and QLoRA

Preference tuning, alignment, and guardrails

Evaluation and retrieval (RAG)

Inference engineering

Deployment, operations, and capstone

The tools that powerproduction SLMs.

Build somethingyou can actually ship.

A confidential legal contract assistant

Dr. Sreedath Panat

Start your research with a head start.

What's in the kit

Example research topics

Build your workshop

Full Bootcamp (8 Days)

Research Starter Kit

1:1 Mentorship (3 Months, Async)

Full Bootcamp + Research Starter

Full Bootcamp + 1:1 Mentorship

Research Starter + 1:1 Mentorship

Full Bootcamp + Research Starter + 1:1 Mentorship

Your Workshop

What mentorship includes

Target: Publishable Paper

Every Step Guided

Industry + Research Exposure

Paper Reading Guidance

Actionable Next Steps

Ready to build yourenterprise SLM?

Common questions.

About the Bootcamp

Research Starter Kit

1:1 Research Mentorship

After the Bootcamp

Build an Enterprise SLM.
From scratch, end to end.

Small models are the next
enterprise standard.

The people building AI
are betting on small.

The complete toolkit.
One SLM bootcamp.

The full training pipeline.
End to end.

8 days of building.
One complete education.

The tools that power
production SLMs.

Build something
you can actually ship.

Ready to build your
enterprise SLM?