Event | Spatial AI and CV Hack Chat

« Back to event details Sort by:

Hack Chat Transcript, Part 2
12/01/2021 at 21:03 • 0 comments

Brandon12:38 PM
Just like a trained safety personnel can look out for.

ump12:39 PM
to tag on to Dan's question, are there use cases being discussed/debated that are non-human task/automation centered? I don't have examples or inkling what other cases can be used for SpatialAI

Brandon12:39 PM

https://youtu.be/npiG-Dy7yQ4

YOUTUBE LUXONIS

Dan Maloney12:39 PM
So maybe an advanced security system that, instead of sensing doors or windows open and close with switches, watch for, say, someone climbing into a window.

Brandon12:39 PM
For example, fall detection and can be privacy-preserving.

Brandon12:39 PM
Yes, good example Dan.

Brandon12:40 PM
@Erik - can you find the machine safety demo and share it here too?

Erik12:40 PM
or to detect when someone is too close to an operative robot. Spatial AI can detect where robot hand is, and where person is, and measure distance between them, demo here: https://github.com/luxonis/depthai-experiments/tree/master/gen2-human-machine-safety#gen2-human-machine-safety

Brandon12:40 PM
Thanks!

Brandon12:40 PM
@ump - yes. There are all sorts of applications which are not about replacing humans.

Brandon12:40 PM
And many of the problems are just unsolved problems, which were thought unsolvable.

Brandon12:40 PM
One good example is having OAK-based sensors on fishing nets.

Ethan Waldo12:41 PM
Or produce Gcode on the fly for robotic arms

Brandon12:41 PM
To make a smart fishing net.

Brandon12:41 PM
To prevent bycatch. Which is just unsolved now.

Brandon12:41 PM
The only way bycatch is discovered is by bringing up the net - and discovering (with horror) all the dead endangered species that are in it.

Dan Maloney12:41 PM
Is that catching unintended species?

Brandon12:41 PM
Yes.

Brandon12:42 PM
With Spatial AI, you can get the species, size, and locations of all of them. Underwater and communciated like a 56k modem back to the ship (through sound transduction).

Brandon12:42 PM
So you can just pull the nets or move them when species that are too small (common problem) or are endangered (or both) are present.

RichardCollins12:42 PM
I am working on "spinal cord injury" currently. You need rigged 3D models of the person for exoskeletons. You need 3D imaging real time of nerve activity. You need 3D monitoring of muscles. If you use the existing muscles by stimulation you need the models of the target area, the fields used, and careful planning to keep a unique human balanced. It is almost all 3D.

ump12:42 PM
Cool applications! Thanks Brandon

Dan Maloney12:43 PM
There have to be significant problems with turbidity and general poor-visibility conditions underwater in most places. At least I'd imagine so

Brandon12:43 PM
Thanks! A bunch more applications are in the materials below as well @ump :

https://opencv.org/opencv-ai-competition-2021/

https://www.kickstarter.com/projects/opencv/opencv-ai-kit-oak-depth-camera-4k-cv-edge-object-detection/description

Great questions!

Thomas Shaddack12:44 PM
thought. megahertz-range ultrasound transducer array. augment the vision with 3d sonar.

Brandon12:44 PM
@Dan Maloney - yes, the built-in edge filter helps with tubidity.

Brandon12:44 PM

Brandon12:44 PM
Here is IPOZ using it in high turbidity on OAK-FFC-3P.

Brandon12:44 PM
And @RichardCollins that's super cool. 100% agree.

Erik12:44 PM
@RichardCollins our community has created a few great projects that use our devices for 3D pose estimation, eg this one: https://github.com/geaxgx/depthai_blazepose#inferred-3d-vs-measured-3d

Brandon12:45 PM
@Thomas Shaddack yes love that idea. IPOZ is doing similar with Sonar underwater and fusing them.

Brandon12:45 PM
Search "IPOZ" on here:

Brandon12:45 PM

https://www.kickstarter.com/projects/opencv/opencv-ai-kit-oak-depth-camera-4k-cv-edge-object-detection/description

KICKSTARTER OPENCV

OpenCV AI Kit - Lite (and Tiny)

Spatial AI Made Easy, Tiny and Affordable - From the Biggest Name in Computer Vision 7,988 backers pledged $1,028,843 to help bring this project to life. You'll need an HTML5 capable browser to see this content. OpenCV AI Kit - Lite (and Tiny)

Read this on Kickstarter

Brandon12:46 PM
We need to get that video uploaded to Youtube separately... for now I think that's the only place it is on the internet.

Dan Maloney12:46 PM
Ah, cool. I'd imagine other wavelengths of light might help too.

Brandon12:46 PM
Yes. IIRC 400nm is best wavelength.

Brandon12:46 PM
And we have cameras that work there.

Thomas Shaddack12:46 PM
What about extending it to panorama-like arrays? overlapping fields of vision, and we could get 360-degree spatial awareness around the vehicle.

Brandon12:46 PM
400nm best for underwater IIRC. There is dip in absorption there.

Erik12:47 PM
video of IPOZ here

Brandon12:47 PM
Oh thanks Erik~

Riley August12:47 PM
If you're looking for underwater rangefinders, we actually have some stuff in the pipeline, feel free to reach out to myself or Matteo. You're in the right ballpark for laser.

Ethan Waldo12:48 PM
What's the difference between the Oak-D and Oak-D-Lite?

Brandon12:48 PM
@Thomas Shaddack - yes, we have several customers who are doing panorama overlap. Project NorthStar is one and open source for XR/AR.

Erik12:48 PM
@Ethan Waldo here we explain it in details: https://docs.luxonis.com/projects/hardware/en/latest/pages/DM9095.html#oak-d-vs-oak-d-lite

Ethan Waldo12:49 PM
thanks

Brandon12:49 PM
Using OAK-FFC-4P

Brandon12:49 PM

Thomas Shaddack12:50 PM
random thought for cameras. use reflective optics instead of lenses. then we don't rely on the air-water refraction index difference, and can flood the entire assembly with optical silicone oil/gel. voila, no crushable hollow spaces, and no pressure hull needed. electronics (without hollow components like crystal cans...) can withstand immense hydrostatic pressures. trick used for stuff outside of the inner pressure hull of submarines.

RichardCollins12:51 PM
Thanks Eric and Brandon. Besides spinal cord injury there are millions needed assistance because of various kinds of paralysis. For the Internet Foundation I map global groups and then try to put the pieces together. ("spinal cord" "injury") OR "paralysis") has 40.8 Million entry points. I found that having a technology is pretty much useless, unless you map the people working on it, who needs it, and all the subsidiary groups and people affected. So I am listening to you talking as see all the groups trying to work on the same technologies. Hackaday.IO could be an incubator and work with all other similar and related efforts on the Internet. Sorry to interupt. I am reading what you are talking about and trying to note where it can be used. Imagine keeping market studies for every idea in the world, along with competitive and business intelligence reports.

Erik12:51 PM
@Thomas Shaddack I believe IPOZ are using really thin quartz glass in front of the cameras and have cameras flush to the surface of the glass (to minimize refraction)

anfractuosity12:52 PM
you can get lensless cameras, that use a fancy pattern rather than say pinhole/zoneplate, i can't seem to find them at the moment though

Erik12:56 PM
@RichardCollins in a sense it already is, bringing the opensource community together:)

Erik12:56 PM
@anfractuosity how would stereo disparity work in at such camera?

Brandon12:56 PM

https://www.youtube.com/watch?v=XS1FtWqXWQk

YOUTUBE OPENCV AI

Brandon12:57 PM
This might be of interest too @RichardCollins .

Brandon12:57 PM
And agreed great idea!

Dan Maloney12:59 PM
We're about an hour in now, which is where we usually like to give the host(s) a chance to bail and get back to work. So I'll say my official thank you to Erik and Brandon for stopping by today, and to everyone for the interesting discussion. Food for thought for me -- I can think of a couple of different applications for spatial AI that I'd like to try.

anfractuosity12:59 PM
@Erik just found the paper https://arxiv.org/abs/1509.00116 i'm not sure afraid

Riley August12:59 PM
Thanks for coming. I got a pretty good perspective on other people's hardware, and that's always good to have!

RichardCollins12:59 PM
Erik, "open source" OR "opensource" is 201 Million entry points. It is NOT an integrated whole on the Internet and in the real world. I know the reasons why and can bypass and map from the outside. But getting groups so they actually collaborate to create real results for real people and needs, is a lot different than a bunch of people just talking about the same thing. Thanks for the links.

Brandon1:00 PM
Thank you all!

RichardCollins1:00 PM
Thanks for the great ideas and methods!

Dan Maloney1:01 PM
Thanks everyone. I'll post a transcript in a few minutes, in case you missed any links.
Hack Chat Transcript, Part 1
12/01/2021 at 21:02 • 0 comments

Dan Maloney12:00 PM
Greetings all, welcome to the penultimate Hack Chat of 2021! I'm Dan, and as usual I'll be modding today along with Dusan as we welcome Erik Kokalj to the chat. Erik works on spatial AI at Luxonis, and we're going to go into depth about spatial AI and CV.

Dan Maloney12:00 PM
Sorry, I had to...

Erik, are you out there yet?

Lutetium12:00 PM
Hello and welcome!

Matteo Borri12:00 PM
yay!

Erik12:01 PM
Hello everyone!

Ethan Waldo12:01 PM
hi

Dan Maloney12:02 PM
Hey, welcome, great to have you today. Can you tell us a little about yourself, and maybe how you came to be working in machine vision?

Erik12:03 PM
Sure, so I am a software engineer with an electrical engineering background. I come from Slovenia, Europe and I am opensource enthusiast

Erik12:04 PM
I started wokring at Luxonis about a year ago, prior to that I didn't have much machine vision experience, and at Luxonis I started working on examples, tutorials, technical documentation, technical support etc.

Erik12:04 PM
So I learned most of it over time doing demos/experiments

Dan Maloney12:05 PM
Sounds like the perfect way to learn, at least for me -- sink or swim, kinda.

Dan Maloney12:07 PM
So I've got a starter question: when you say "spatial AI", is that really just what a Kinect or some similar depth sensor does? Or is there more to it than that?

Erik12:07 PM
yes, exactly:) A lot of "hard" technical stuff is also abstracted by our library, so me, working on demos, didn't need that much machine vision experience

Erik12:08 PM
yes, it's very similar to Kinect. So TL;DR it's combining depth perception + AI, which can be used extensively across many fields

Erik12:09 PM
just copying some use-cases:

- Visual assistance (for visually impaired, or for aiding in fork-lift operation, etc.)

- Aerial / subsea drones (fault detection, AI-based guidance/detection/routing)

- E-scooter & micromobility (not allowing folks to ride rented e-scooters like jerks)

- Cargo/transport/autonomy (fullness, status, navigation, hazard avoidance)

- Sports monitoring (automatically losslessly zooming in on action)

- Smart agriculture (e.g guiding lasers to kill weeds, pests, or targeting watering)

Riley August12:09 PM
I'm very interested in what the state of the art hardware-wise is on the open source side there.

Dan Maloney12:09 PM
I guess that's where my confusion comes from, really -- there seems like so much you can do with "plain old CV" that doesn't need depth detection. But then again, depth really opens up some interesting doors. Add in the AI component, and it seems really powerful.

Erik12:10 PM
@riley.august most of our baseboards are opensource, at least all where Myriad X (VPU by Intel) isn't on

Riley August12:10 PM
Ooh. I'll have a look, it's nice to see other companies contributing back to the maker community like that. Depth detection does take a lot of the guesswork out of interpreting a 2D image.

Thomas Shaddack12:10 PM
thought. light field cameras. highly processing-intensive but gives intrinsic 3d image.

Erik12:10 PM
yes, exactly:)

Brandon12:10 PM
And disparity depth is most similar to human vision.

Brandon12:11 PM
And like human vision, it works in all sorts of conditions.

Brandon12:11 PM
Whereas structured light, etc. can self-interfere, have lighting limitations (may not work in sunlight) etc.

Brandon12:11 PM
Whereas disparity depth is passive. Largely works in similar conditions to our heads. :-)

Erik12:12 PM
@Dan Maloney yes, true, eg. speed estimation, distance between 2 objects, or just to know where something is (for robots)

Dan Maloney12:12 PM
"Structured light" -- is that like lidar or something different?

charliex12:12 PM
how does it do perform on specular surfaces

Erik12:13 PM
@Dan Maloney it's active stereo, so usually there's IR laser (either dot projector or lines) so disparity depth can be more accurate, and especially useful for low interest surfaces (where there aren't many features for disparity matching, eg. wall or floor)

Dan Maloney12:13 PM
Gotcha

Thomas Shaddack12:14 PM
the rotating table 3d scanners with a line laser projected onto an object are a rudimentary kind of that. with structured light there are more known-shape lines (or dots) and the object doesn't have to rotate.

Erik12:14 PM
@charliex (googling what that means)

Matteo Borri12:14 PM
can i use that as a lidar for short distances (2 meters) ?

charliex12:15 PM
specular reflection, so like shiny objects

Dan Maloney12:15 PM
Isn't that highly reflective surfaces? Like a mirror?

Brandon12:15 PM
Yes, OAK-D will produce depth at 2 meter range.

Erik12:15 PM
ah reflective. Im quite sure it wouldn't work

Thomas Shaddack12:15 PM
i saw datasheets for time-of-flight camera sensors. 320x240 or 640x480 i think. pretty amazing resolution.

Thomas Shaddack12:15 PM
lightfield is said to work even on reflective stuff.

Brandon12:15 PM
Stereo neural inference is what should be used for shiny objects @charliex

Brandon12:16 PM

https://github.com/luxonis/depthai-experiments#stereo-neural-inference-results-visualizer-here

GITHUB LUXONIS

GitHub - luxonis/depthai-experiments: Experimental projects we've done with DepthAI.

depthai_experiments中文文档 Experimental projects we've done with DepthAI. Experiments can be anything from "here's some code and it works sometimes" to "this is almost a tutorial". The following list isn't exhaustive (as we randomly add experiments and we may forget to update this list): This example demonstrates how to run 3 stage (3-series, 2 parallel) inference on DepthAI using Gen2 Pipeline Builder.

Read this on GitHub

Brandon12:16 PM
It uses AI to locate the object of interest.

charliex12:16 PM
@Thomas Shaddack same principles apply , a lightfield camera still suffers ifor specular capture if its using standard imaging sensors

Brandon12:16 PM
For example the object of interest could be a mirror or mirror balls or whatever.

charliex12:17 PM
@Brandon yeah thats what i was wondering if there was an assist

Thomas Shaddack12:17 PM
there is a more accessible ToF sensor, 16x16 pixels. the VL53L1X - may be of interest though the field of vision is at least without optics annoyingly narrow.

Brandon12:18 PM
Yes. So with disparity depth the resolution can be much higher for low price as well.

Brandon12:18 PM
For example we do 640x480 depth along with 13MP color in a $149 camera.

charliex12:18 PM
optical flow for the disparity or is there some more clever stuff going on?

Brandon12:18 PM
And 1,280x800 (1MP) depth along with 12MP color camera for $199

Brandon12:19 PM
So the disparity engine is census transform based.

Thomas Shaddack12:19 PM
what depth resolution, approx?

Brandon12:19 PM
Which produces great depth for the power.

Brandon12:19 PM
1,280 x 800 depth resolution.

Thomas Shaddack12:19 PM
i mean in millimeters.

Erik12:20 PM
oh accuracy. So below 3% error (at good conditions)

Brandon12:20 PM

https://docs.luxonis.com/projects/api/en/latest/components/nodes/stereo_depth/#internal-block-diagram-of-stereodepth-node

Erik12:20 PM
and for passive stereo that's good lighting and good texture of the surface

Brandon12:20 PM
And here is the block diagram of how it works. And Erik is right, 3% of distance is about what disparity depth can do.

Matteo Borri12:21 PM
thankyou!

Brandon12:21 PM
We also have a ToF version coming Q1 next year, 1% of distance error.

charliex12:21 PM
thanks for the link

Brandon12:22 PM
:-)

Thomas Shaddack12:22 PM
could be pretty handy for forklifts.

Erik12:22 PM
for forklift automation @Thomas Shaddack :)?

Thomas Shaddack12:23 PM
yup, or semiautomation. in the beginning, make sure the driver never runs into something expensive.

Brandon12:23 PM

https://www.youtube.com/watch?v=7GCIuG0-RqY

YOUTUBE OPENCV AI

Erik12:23 PM
that's actually exactly what one of our partners are doing

Brandon12:23 PM
Here is example of it being used on Forklift as Erik mentioned ^

charliex12:23 PM
definitely interested to see how well it performs, waiting on delivery

Erik12:23 PM
let me find the video of results @charliex

Erik12:24 PM

https://photos.app.goo.gl/NvhbMvy8W4tvxLQg8

GOOGLE PHOTOS

New video by Erik Kokalj

Read this on Google Photos

charliex12:24 PM
yeah if you've got depth buffers available to see what the errors are like, especially on the edges super interested to see

Dan Maloney12:24 PM
BTW, everyone, @Brandon is from Luxonis too

Erik12:25 PM
these are initial results with TOF

Brandon12:25 PM
Thanks Dan! (And sorry about late-ish join)

Dan Maloney12:25 PM
No worries ;-)

Brandon12:26 PM

https://opencv.org/opencv-ai-competition-2021/

OPENCV

OpenCV AI Competition

Courses are (a little) oversubscribed and we apologize for your enrollment delay. As an apology, you will receive a 10% discount on all waitlist course purchases. Current wait time will be sent to you in the confirmation email. Thank you!

Read this on OpenCV

Brandon12:26 PM
More applications here.

ump12:26 PM
@Erik in that video, is the 3D model generated from real time operation from the camera?

Brandon12:26 PM
Yes.

Brandon12:27 PM
In that example the RGB alignment and colorized point cloud are generated on-camera.

Brandon12:27 PM

https://www.youtube.com/watch?v=hEtcsiO-x1M&feature=emb_logo

YOUTUBE OPENCV AI

Brandon12:27 PM
Here's my favorite application ^

Ethan Waldo12:32 PM
*cough*

Brandon12:33 PM
Any other questions?

Ethan Waldo12:33 PM
*shuffles feet nervously*

Brandon12:33 PM
Ha. Well said.

charliex12:33 PM
mostly RTFMing here

Brandon12:34 PM
Heh. Even better said. There's a ton of flexibility. But we make it so you can get up and running in a couple minutes, usually.

Erik12:34 PM
let me ask a question to Brandon then: why did you start this company/platform?

Brandon12:35 PM

https://youtu.be/7R2N4detrjc

YOUTUBE BRANDON GILLES

Brandon12:35 PM
WRT getting going.

Brandon12:35 PM
To Erik's question, thanks!

Brandon12:35 PM
So the full story is here:

Brandon12:35 PM

https://discuss.luxonis.com/d/8-it-works-working-prototype-of-commute-guardian

LUXONIS

It Works! Working Prototype of Commute Guardian.

This site is best viewed in a modern browser with JavaScript enabled. Hey guys and gals! So the 'why' of the DepthAI (that satisfyingly rhymes) is we're actually shooting for a final product which we hope will save the lives of people who ride bikes, and help to make bike commuting possible again for many.

Read this on Luxonis

Brandon12:36 PM
But we were working in AR/VR and specifically the perception-in-physical-space bit.

Brandon12:36 PM
When a bunch of folks in our network were hit by distracted drivers.

Brandon12:37 PM
1 killed. 3 mortally wounded 2 of which will never walk right again, and the 3rd can no longer program (was an excellent programmer) because of a traumatic brain injury.

Dan Maloney12:37 PM
I'm not sure if this is an appropriate question, but what do you think the killer app for spatial AI is? Is it already out there, or is it a use case that has yet to be explored?

Brandon12:37 PM
Great question.

Brandon12:37 PM
So one of the most important use-cases is life and safety.

Brandon12:37 PM
As spatial AI can perceive the world like a human can.

Brandon12:37 PM
So it can be used to make automated safety systems that were the dreams of science fiction - just 10 years ago.

Brandon12:38 PM
By telling what is going on, if a person, hand, elbow, whatever are in a position of danger.