NVIDIA Track | MLSys 2026 FlashInfer AI Kernel Generation Contest

Contest Overview

🎯

The Challenge

Create AI agents that automatically generate optimized CUDA kernels for cutting-edge LLM operations. Your agent will receive kernel specifications and must produce high-performance code for NVIDIA Blackwell B200 GPUs.

📊

Benchmark

Compete across workloads derived from production models. Kernels are evaluated on correctness, speed, and win rate against FlashInfer baselines.

Platform

Submit and evaluate your kernels on FlashInfer-Bench (bench.flashinfer.ai).

🤖

Two Approaches

We welcome both expert-crafted seed kernels with agent-assisted evolution, and fully agent-generated solutions. The two approaches will be evaluated separately. Agent solutions must open-source scripts to reproduce kernels. No API credits provided.

Competition Tracks

Three kernel categories targeting the most important operations in modern LLMs

Track A

Fused MoE

Fused Mixture-of-Experts kernels with FP8 support.

Track B

Sparse Attention

Deepseek Sparse Attention from Deepseek V3.2

Track C

Gated Delta Net

Gated Delta Net used in Qwen3-Next

Getting Started

Everything you need to start competing

📦

Starter Kit

Development environment setup and test/benchmark scripts.

View Starter Kit

📋

Submission Format

Agent interface specifications and kernel solution requirements.

Coming Soon

🎯

Evaluation

Scoring metrics, correctness thresholds, and ranking methodology.

Coming Soon

📖

Baselines

FlashInfer production kernels and OpenEvolve-based references.

Coming Soon

Timeline

Jan 22, 2026

Public Launch

Registration opens
Starter kit released

Feb 9, 2026

Baselines Released

OpenEvolve-based baselines available

Feb 15, 2026

Registration Deadline

Last day to register your team

Apr 24, 2026

Kernel Submission Deadline

11:59 PM AoE

May 1, 2026

Writeup Deadline

Technical report due (max 4 pages)
11:59 PM AoE

May 11, 2026

Winners Notified

Results announced via email

May 17-22, 2026

                        MLSys 2026 Award Ceremony
                        Bellevue, WA
Winners present their solutions

Prizes & Resources

🏆

GPU Prizes

GPU cards for top performing teams. Details coming soon.

🎫

Free Registration

Winners receive complimentary MLSys 2026 conference registration.

💻

GPU Access

Registered teams receive Modal compute credits for NVIDIA B200 GPU development.

Ready to Compete?

Join teams from around the world in pushing the boundaries of AI kernel generation.

Register Your Team

Registration deadline: February 15, 2026

Resources

📦 FlashInfer GitHub 📊 FlashInfer-Bench 🏛 MLSys 2026 📝 DeepSeek V3 Paper 📝 Gated Delta Net Paper