The Challenge
A consumer electronics company developing next-generation smart wearables and hearables needed a custom AI accelerator that could run sophisticated ML models continuously while maintaining week-long battery life on a coin cell battery.
Our Solution
We designed a fully custom RISC-V based SoC with an integrated neural processing unit (NPU) optimized for TinyML workloads, featuring aggressive power gating, in-memory computing elements, and a flexible dataflow architecture.
System Architecture
Heterogeneous architecture combining a RISC-V application processor with custom neural accelerator blocks and comprehensive power management.
Chip Specifications
| Process Node | 12nm FinFET (TSMC) |
| Die Size | 9mm² (3x3mm) |
| Package | WLCSP 4x4mm, 81 balls |
| NPU Performance | 1 TOPS @ 100MHz |
| Power Efficiency | 10 TOPS/W (INT8) |
| Always-On Power | < 5mW (voice wake + basic inference) |
| Deep Sleep | < 1µA with RTC |
Software Stack
- Custom LLVM toolchain with RISC-V extensions
- Lightweight RTOS optimized for power management
- TensorFlow Lite Micro with custom kernels
- Model compiler with quantization support
- Power-aware scheduling runtime
- Secure OTA update mechanism
- HAL with power state management APIs
TinyML Model Support
The NPU architecture was optimized for common TinyML workloads while maintaining flexibility for model updates.
Keyword Spotting
DS-CNN (Depthwise Separable CNN)
96% accuracy on custom vocabulary
8ms inference, <3mW power
Voice Activity Detection
RNN-based classifier
98% detection accuracy
Always-on at 0.8mW
Person Detection
MobileNetV3-Small variant
92% accuracy at 96x96 resolution
45ms inference, <15mW power
Gesture Recognition
1D CNN on accelerometer data
94% on 12 gesture classes
5ms inference, <1mW power
Sensor Fusion
Multi-input neural network
Activity recognition, context awareness
Continuous at 2mW
Implementation Timeline
Results & Impact
The custom RISC-V AI accelerator exceeded all specifications, enabling a new category of always-on intelligent devices with week-long battery life and sophisticated on-device AI capabilities.
Energy Efficiency
Always-On Power
Inference Latency
BOM Cost
PCB Area
Licensing Costs
Return on Investment
Implementation Cost
Multi-year silicon development investment
Annual Savings
Payback Period
5-Year ROI
“Rapid Circuitry delivered exactly what we needed - a custom AI chip that lets us differentiate in a crowded market. The 10x efficiency improvement enabled features our competitors simply cannot match. Our devices now have always-on AI with week-long battery life, and we own the IP completely.”
CTO
Client Consumer Electronics Company
Technologies Used
Awards & Recognition
RISC-V Summit Innovation Award 2025
Best Commercial RISC-V Implementation
Embedded Computing Design Award
Most Innovative AI Processor
IEEE Solid-State Circuits Best Demo
Ultra-Low-Power AI Accelerator