TECHNICAL BLOG
Deep Dives for Engineers
Detailed technical articles covering the real problems we solve in embedded systems, AI, and robotics engineering.
Detailed technical articles covering the real problems we solve in embedded systems, AI, and robotics engineering.
How to fit inference into kilobytes: quantization, feature extraction, and deployment constraints on microcontrollers.
On MCUs, you rarely deploy a raw neural network alone. You deploy a pipeline: feature extraction + tiny model.
Quantization (int8) is the default, but it’s only part of the story—memory layout, operator support, and IO timing dominate real performance.
We recommend starting with a constrained target (SRAM/flash budget), then designing the model to fit—rather than trimming after the fact.
Measure energy per inference, not just accuracy; battery life is often the real KPI.
Continue reading — handpicked articles you might enjoy