What is Kyttar?
Kyttar is a massively parallel asynchronous processor designed for real-time signal processing. It is a 2D grid of 120 identical processing cells with no clock, no flip-flops, and no central controller. Each cell has 32 words of memory, a small ALU, and communicates with its four neighbors through WRITE and JUMP instructions. Data flows through the array the way chemical signals flow through mycelium networks: locally, in parallel, without any central coordination.
The architecture uses 4-phase bundled-data handshaking with Muller C-elements, matched delay chains, and latches instead of flip-flops. Everything is self-timed. There is no clock tree, no clock domain crossing, no CTS. The cells coordinate entirely through local handshake signals.
Why?
The target application is software-defined radio. SDR workloads are inherently parallel: dozens or hundreds of channels need to be demodulated, filtered, equalized, and decoded simultaneously in real time. CPUs process these sequentially. FPGAs can do it in parallel but require hardware design expertise. Kyttar is designed to give you the parallelism of an FPGA with the programmability of a processor. You write signal processing algorithms in simple Q15 fixed-point assembly. No Verilog, no timing constraints, no clock domain crossings. The hardware handles all of that.
A single Kyttar chip can run multiple independent signal processing chains simultaneously on different groups of cells. Demodulators, filters, equalizers, vocoders, decoders, all running at wire speed.
The Architecture
At the heart of every async pipeline stage is a Muller C-element, the foundational building block of asynchronous computing. A C-element outputs high when all inputs are high, low when all inputs are low, and holds its previous state when the inputs disagree:
| A | B | OUT | |---|---|-----| | 0 | 0 | 0 | Both low: output low | 0 | 1 | OUT | Disagree: hold | 1 | 0 | OUT | Disagree: hold | 1 | 1 | 1 | Both high: output high
C-elements control the latches. A request signal propagates forward through a matched delay element to the next stage, and an acknowledge signal comes back. The delay element guarantees the data is stable before the next latch captures it. It looks similar to a synchronous pipeline (sequential elements with combinational logic between them), except instead of a master clock, each stage coordinates only with its immediate neighbors.
The Chip
The first Kyttar chip is a 120-cell array fabricated on the SkyWater SKY130 130nm process through ChipFoundry's ChipIgnite program. The die sits inside an OpenFrame pad ring with 44 GPIOs, power/ground pads, and ESD protection. Not having to deal with the pad ring and ESD is a HUGE win. You can easily sink months into sorting that mess out, so kudos to the ChipFoundry guys for supplying this. Total cost for 100 packaged chips: roughly $15,000. Obviously, this is a test shuttle, so the price per die is very high. On this process node, with volume, production prices will drop by a couple orders of magnitude.
Status: Tapeout May 13th 2026. Packaged parts expected October 2026.
The Flow
The entire RTL-to-GDSII flow is open source:
- Yosys for synthesis
- OpenROAD for place and route
- Magic for custom layout, DRC, and parasitic extraction
- ngspice for SPICE simulation
- Netgen for LVS
- Icarus Verilog for functional simulation
- CVC for SDF annotated gate level simulation
- LibreLane for flow orchestration
- ChipFoundry's cf wrapper for project-level build management
No commercial EDA licenses or six-figure tool subscriptions. All you need is a Linux machine and the willingness to learn. Are these tools as good as the tools from the big 3 tool vendors? No, but they do a pretty good job, especially at this price point. Also, everything is bundled for you in LibreLane, so you just have to deal with configuration files, not the entire tool chain bring up, which by itself is a huge win for a startup.
Ham Radio Applications
I'm a licensed Extra Class ham (K9VIB). I've been fascinated by the concept of cognitive radio for many years. While this chip won't have the horsepower to enable this (yet), the architecture will scale to enable full spectrum awareness. Scaled up versions will be developed after the hardware is proven out. That said, this chip was designed with the amateur radio community in mind:
- MIL-STD-188-110D HF modem implementation
- Parallel demodulation of multiple channels simultaneously
- Real-time digital voice processing (FreeDV, D-Star, DMR, M17, etc.)
- AFSK1200 / AX.25 packet radio processing
- Multi-mode decoding: run several demodulators at once and auto-detect the mode
- Wideband channelizer: split a chunk of spectrum across the cell array
My envisioned end application for this space is an SDR board that you can plug into your radio like any existing SDR, but the SDR itself would handle all of the heavy lifting, not your PC. This provides a much richer DSP experience while also enabling use of DSP heavy modes on something as simple as a Raspberry Pi or Arduino. Developing the SDR dev board will be my next priority after the chip tapes out. The first demo I plan to build is a full MIL-SPEC-188-110D modem.
Software
The software stack will be open source:
- GnuRadio blocks for integration with existing SDR workflows
- Cycle-accurate simulator Rust based for speed with Python bindings for development and testing. Also used as the predictor model for hardware verification
- DSP block library with validated signal processing blocks (20+ blocks including AGC, FIR filters, FM/AM/SSB demodulators, NCO, DFE)
- Assembler for the Q15 fixed-point ISA
The chip is programmed in assembly. Each cell runs a small program (fits in 32 words). No hardware design experience needed to write signal processing algorithms. If you can write a DSP algorithm in C or Python, you can write one for Kyttar. And quite honestly, the ISA and architecture is simple enough that AI agents like Claude and friends can pretty well write up the DSP algorithms for you in very little time.
Blog Series
I'm documenting the entire journey from RTL to silicon in a weekly blog series on the Lattrex website:
- Week 1: The Open-Source Chip Toolchain
- Week 2: LibreLane - The Flow That Ties It All Together
- Week 3: Synthesis Without a Clock
- Week 4: Timing Constraints Without a Clock
- Week 5: The Custom Cell Detour
New posts every week. More coming on verification, SPICE simulation, and silicon bring-up once the chips arrive.
About
Kyttar is built by Lattrex Inc., a one-person semiconductor startup in Oak Ridge, Tennessee. I spent 15 years building and verifying chips at Microchip Technology before starting Lattrex to build this processor.
Website: lattrex.com
Chuck McClish