SpliDT - Scaling Stateful Decision Tree Algorithms in P4

Posted Sep 9, 2025

By P4 lang

6 min read

SpliDT: Scaling Stateful Decision Tree Algorithms in P4

Google Summer of Code 2025 Final Report

Organization: P4 Language Consortium
Contributor: Sankalp Jha
Mentors: Murayyiam Parvez, Annus Zulfiqar, Ali Imran, Davide Scano, Muhammad Shahbaz
Project Repository: SpliDT Codebase

Project Overview
Project Goals
- Core Framework
- Production-Ready Components
Implementation Details
- Project Architecture
- Repository Structure
Future Scope
Conclusion
References

Project Overview

SpliDT is a switch-native compiler framework that enables stateful decision tree inference directly in programmable switches, bringing real-time machine learning into the network data plane. SpliDT compiles high-performance decision tree models to enable detection and observability of security-significant flow behaviors across diverse traffic workloads.

A major challenge in deploying decision trees in this environment is the limited stateful memory of ASIC chips, which makes it impossible to store multiple packet features simultaneously. SpliDT solves the issue with Partitioned Decision Trees (PDTs). Instead of evaluating all features simultaneously, the tree is split into smaller subtrees, each handling only top k-features at a time. Flows are guided across subtrees using Subtree IDs (SIDs), ensuring that all features are eventually considered without exceeding hardware limits. This design reduces memory usage, removes latency overheads, and maintains classification accuracy, while making the system scalable and efficient.

By combining P4-based dataplane logic with a lightweight control plane, SpliDT provides a practical and extensible framework for developers and researchers working on in-network ML, traffic classification, and real-time security detection.

Project Goals

Core Framework

Stateful P4 Implementation: Built complete decision tree classifier with SID-based traversal, recirculation logic, and multi-stage processing
Dynamic Controller System: Developed P4Runtime and Barefoot Runtime integration supporting installation of control plane rules for partitioned models, graceful error handling
Automated Code Generation: Created Jinja2-based template system generating complete P4 programs from user-given partitioned DT models
Hardware Validation: Successfully deployed and tested on Intel Tofino Model and BMv2 software targets

Production-Ready Components

Component	Status	Functionality
P4 Data Plane	Completed	Stateful classification, SID management, packet recirculation
P4Runtime/BftRuntime Controller	Completed	Custom-DT Model loading, rule installation
P4 Code Generation Framework	Completed	Automated P4 generation from ML models
Testing Infrastructure	Completed	Mininet simulation, packet verification
Deployment Automation	Completed	Makefile workflow for reproducible deployments

Implementation Details

Project Architecture

The SpliDT framework implements a complete model-to-deployment pipeline that transforms network datasets into hardware-optimized decision tree inference running on programmable switches. The architecture bridges machine learning model training with P4-based data plane deployment through automated code generation and runtime management.

1. Model Compilation (SpliDT Compiler)

Repository Location: dt-framework/ + custom_dts/

The SpliDT Compiler processes raw network datasets and produces optimal partitioned decision tree models:

Input: Dataset, target objectives, performance constraints
Components:
- CICFlowMeter: Extracts bidirectional flow features from PCAP files
- HyperMapper: Automated hyperparameter optimization for tree partitioning
- Grafana + Postgres: Performance monitoring and dataset analysis
Training Process: Uses design search exploration and feasibility testing to determine optimal subtree partitions Output: Partitioned decision tree model as JSON/DOT files + corresponding pickle files
Key Innovation: The compiler automatically determines how to split decision trees into SID-based subtrees that fit within ASIC memory constraints while maintaining classification accuracy.

2. Code Generation and Standardization (SpliDT Generator)

Repository Location: utility/

The SpliDT Generator transforms trained models into deployable P4 programs:

Input Processing:

utility/filter/: Processes decision tree models to generate files that map the required stateful features in the P4 program to their corresponding operations (sum, min, max)
utility/netbeacon/: Converts decision tree models into TCAM rules (inspired by NetBeacon [1] )

Code Generation:

utility/p4codegen/: Jinja2-based P4 generator that creates complete P4 programs from model inputs

Outputs:

P4 Program: Complete data plane implementation with SID-based stateful processing
Controller Code: P4Runtime client for dynamic rule installation
Configuration Files: Mapping between model features and P4 metadata fields
3. Runtime Deployment (Control + Data Plane)
Repository Location: dataplane_driver/

The Runtime Deployment stage compiles and deploys the generated code:

Compilation Pipeline:
- P4 Compiler: Processes generated P4 program → produces target binary
- .p4info Generation: Creates P4Runtime interface definitions
- Target Driver: Intel Tofino(switch) or BMv2(mininet) software switch initialization
Control Plane Operation:
- P4/Bft Runtime Client: Installs the TCAM Rules of each subtree by matching on the Subtree ID(SID)
- gRPC Communication: Installs match-action table entries via P4Runtime protocol
Data Plane Execution:
- Stateful Processing: Packets processed through SID-based subtree traversal
- Feature Extraction: Headers parsed into metadata fields f1, f2, f3, sid
- Classification: Partitioned decision-tree inference with recirculation
- Result Output: Classification results via digest emission

Repository Structure

Future Scope

Ansible-based Deployment: Ansible playbooks for automating environment setup, model deployment, controller startup.
MoonGen Traffic Generation Integration: Enable 100 Gbps stress testing with realistic traffic patterns for comprehensive performance validation
Decision Tree Pipeline Integration: As future work, the optimal partitioned models generated using HyperMapper to maximize both accuracy and supported flows will be translated into P4. In the current prototype, we assume the model is provided by the user.

Conclusion

This GSoC journey gave me valuable research experience in the field of computer science and networking within an open-source community. More specifically, the SpliDT project allowed me to improve my understanding of P4, while also exploring network-programmable devices and addressing real research challenges, such as implementing decision tree inference directly in programmable switches.

Beyond the technical work, I also had the opportunity to engage with the P4 Language Consortium Community by participating in community activities and contributing to my project. This experience inspired me to write a blog to share what I learned during Google Summer of Code.

The friendly tech savvy, tech-savvy mentors really helped me nurture a lot along with the project, and I would like to thank them again.

With such a colorful time over the past few months, I am truly grateful to everyone for believing in me and making things very real and possible!
Thanks a lot with deepest regards!

References

[1] NetBeacon Project: IDP-code/NetBeacon
Guangmeng Zhou, Zhuotao Liu, Chuanpu Fu, Qi Li, and Ke Xu. An Efficient Design of Intelligent Network Data Plane.
In 32nd USENIX Security Symposium (USENIX Security 23), pages 6203–6220, Anaheim, CA, August 2023. USENIX Association.

GSOC 2025

gsoc2025

This post is licensed under CC BY 4.0 by the author.

SpliDT - Scaling Stateful Decision Tree Algorithms in P4

SpliDT: Scaling Stateful Decision Tree Algorithms in P4

Google Summer of Code 2025 Final Report

Table of Contents

Project Overview

Project Goals

Core Framework

Production-Ready Components

Implementation Details

Project Architecture

1. Model Compilation (SpliDT Compiler)

2. Code Generation and Standardization (SpliDT Generator)

3. Runtime Deployment (Control + Data Plane)

Repository Structure

Future Scope

Conclusion

References

Trending Tags