Robot hardware
- cortex m4f mcu, 72MHz, stm32f303
- motor driver TI drv8834
- pololu HP motors 1:30 with magnetic encoders
- pololu 28mm tyres, high adhesion
- IMU (gyro + accelerometer) LSM6DS0
- 8 line sensors sensitive for green light (540nm), 3 IR obstacle sensors
- 180mAh LiPol 2S acu
- USB programming via bootloader
data:image/s3,"s3://crabby-images/5e117/5e117e192f62c0664a4783bb5e7531fde30c4f46" alt=""
opengl debug application
I created user friendly debug application, showing :
- line sensor raw data
- motors status - speed, encoders
- IMU
- neural network classification process
- raw data from USART
data:image/s3,"s3://crabby-images/edb93/edb93be4e7c39bfee1a7e6192cf2061b8c63c8ac" alt=""
Robot software is based on this key points :
- quadratic interpolation : high precission line position computing
- main loop running on 200Hz
- steering PID : PD controller for steering
- motor PID : two PIDs for motors speed controll
- curve shape prediction : run on straight line, brake on curve; deep neural network
- written in C++
- network training on GPU - own framework for CNN
data:image/s3,"s3://crabby-images/fcc86/fcc8652de431920272b1d8966940fa1cbf7322c6" alt=""
Neural network for line shape prediction
- fast run on straight line, brake on curve
- neural network for line type classification, DenseNet - densely connected convolutional neural network
- input 8x8 matrix raw data from line sensors, 8 past line positions from 8 sensors
- output 5 curves types (two right, two left, one straight
Line prediction idea from last 8 line positions is on following figure
data:image/s3,"s3://crabby-images/c36de/c36def73e1c0f0b7ea39dc30210606bd1db2e871" alt=""
Line dataset was made as artificially generated data,
- 25000 for training
- 5000 for testing
- augmentation - luma noise, white noise
Network architecture
I was fighting with robustness against network speed. Network have to run 200Hz.
I decide to use DenseNet, which is very computation effective - in layers you need less kernels than in pure CNN.
data:image/s3,"s3://crabby-images/2d15d/2d15d2b16a14ee111dcfb5b2fad3c3ecbd23fede" alt=""
Final network architecture is :
which is able to achieve 95% classification accuracy
data:image/s3,"s3://crabby-images/b41a2/b41a24873f75847adde2554c572f3bbbae2fb9ab" alt=""
layer | layer type | input feature map tensor size |
0 | convolution 3x3x4 | 8x8x1 |
1 | max pooling 2x2 | 8x8x4 |
2 | dense convolution 3x3x4 | 4x4x4 |
3 | dense convolution
3x3x4 | 4x4x8 |
4 | full connected 5 | 4x4x12 |
Fitting network into embedded
- convert float weights to int8_t
- scale weights into 8bit range
- use double buffer memory trick - to save memory, use only two common buffers for all layers
data:image/s3,"s3://crabby-images/0ea6d/0ea6da44cb871ef2a69debe2ed56f93dc639a89b" alt=""
how did you train the robot? and why the IMU?