ESP32 stereo camera for object detection, recognition and distance estimation

แชร์
ฝัง
  • เผยแพร่เมื่อ 17 ม.ค. 2023
  • In this video, I show how I made an esp32 stereo camera, which can be used for object detection and distance estimation, with quite good results. The camera consists of two esp32-cams, a perf board, some female header pins, some wires, a battery shield (with 5v output and charger) and a battery. The camera streams into a python notebook, and uses the mask-rcnn model from the pytorch library for segmentation of left and right images. Then it matches the objects from the left image to the objects from the right image and does distance estimation.
    GitHub code:
    github.com/jonathanrandall/es...
    Artificial Intelligence graduate certificate that I am doing:
    www.latrobe.edu.au/courses/gr...
    And graduate diploma:
    www.latrobe.edu.au/courses/gr...
    Daniel Rossi’s ESP32-CAM Python stream OpenCV project:
    www.hackster.io/onedeadmatch/...
    Battery Shield
    www.aliexpress.com/item/32870...
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 122

  • @belalsafy7993
    @belalsafy7993 4 หลายเดือนก่อน +4

    I think this is the best video ever on TH-cam. It contains exactly what I need and is clearly explained. Thank you with all my heart. Please keep me going.

    • @jonathanr4242
      @jonathanr4242  4 หลายเดือนก่อน +1

      Thank you. That is very kind of you.

  • @deoarlo
    @deoarlo 14 วันที่ผ่านมา +1

    Amazing work Jonathan! Esp is hands down my favourite board so thank you for documenting your work!

    • @jonathanr4242
      @jonathanr4242  14 วันที่ผ่านมา

      Thank you, Deo. That is very kind.

  • @zachd4785
    @zachd4785 7 หลายเดือนก่อน +3

    Jon, trying to wrap my head around this process, but must say, it's clear you know what you're talking about! Very well done on this video and the project.

    • @jonathanr4242
      @jonathanr4242  7 หลายเดือนก่อน

      Thank you, for the kind comment.

  • @ArduinoTex
    @ArduinoTex ปีที่แล้ว +2

    Quite an interesting idea for the project. Thank you for the useful information.

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      Thank you. That is very kind of you to say.

  • @avikdas18
    @avikdas18 ปีที่แล้ว

    Amazing tutorial. Thank you so much for the beautiful explaination ❤❤

  • @aditya_01
    @aditya_01 8 หลายเดือนก่อน +1

    Thanks that was reallly helpful

  • @gouledawad2377
    @gouledawad2377 ปีที่แล้ว

    Great content. Thank you for making it easy to understand.

  • @tech_talks_by_marsh
    @tech_talks_by_marsh ปีที่แล้ว +1

    Appreciate your time and good work ..kudos

  • @OMNI_INFINITY
    @OMNI_INFINITY 11 หลายเดือนก่อน +1

    Thanks for posting that and the files! Been wanting a nice stereo camera to use with an HMD, so I guess I may build a small edition of that.

    • @jonathanr4242
      @jonathanr4242  11 หลายเดือนก่อน

      Sounds Awesome! There is so much cool stuff you can do with a stereo camera. I made my own because I had some extra esp32-cams. But, the good thing about getting a proper stereo camera is that you can sync the frames a bit more easily, which is important if you want to do stereo video. I had some difficulties with synchronisation when I tried to use the esp32-cams for stereo video.

    • @OMNI_INFINITY
      @OMNI_INFINITY 11 หลายเดือนก่อน +1

      @@jonathanr4242 Oh? Surprised. With short wires the latency should be short enough to synch the first frame.

    • @jonathanr4242
      @jonathanr4242  11 หลายเดือนก่อน

      @@OMNI_INFINITY I was using wifi with frame buffering. I didn't realise it at the time, but one solution would be to grab the most recent frame in the buffer, and drop the others.

    • @OMNI_INFINITY
      @OMNI_INFINITY 11 หลายเดือนก่อน +1

      @@jonathanr4242 Ah. Can connect rx and tx of those seemingly.

  • @3dnoob
    @3dnoob ปีที่แล้ว

    Wow! This is really great, you explained and simplified a lot of topics in this video. I hope if you can modify your stereo-vision setup to create a structured-light scanner with a projector and OpenCV.

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      Thank you. There are a number of projects, tweaks and modifications I'd like to make. The structured-light scanner is a very good idea.

    • @R00kTruth
      @R00kTruth 6 หลายเดือนก่อน

      I wish @@jonathanr4242 that I had came across your channel/video before I bought the kinectv2 adapter!!!

  • @maheshpatel2005
    @maheshpatel2005 ปีที่แล้ว +1

    Nicely explained

  • @rithikroshan7252
    @rithikroshan7252 7 หลายเดือนก่อน +1

    amazing video!!!

  • @sermadreda399
    @sermadreda399 ปีที่แล้ว

    Great video, thank you for sharing

  • @dhiyaulhaqfairaazjamil3134
    @dhiyaulhaqfairaazjamil3134 6 หลายเดือนก่อน

    Hello sir, Can it be used to detect 3-dimensional geometry shapes?

  • @collinb5524
    @collinb5524 ปีที่แล้ว +1

    Very interesting stuff. Im currently learning openCV from Paul Mcwhorter. Please keep posting

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว +1

      Thank you, Collin. Yes, openCV is powerful stuff. I am working on another project with the stereo camera which I hope to post in the next couple of weeks.

  • @chenbenzvi2164
    @chenbenzvi2164 5 หลายเดือนก่อน

    First, this video is amazing!
    Deployment of the model and communication with the esp32 are quite straight forward.
    The core of the project is the tracking and calculating the distance which I felt you kind of breezed over. I wish you could do an in depth video just on that part. A great video anyway though . Keep it up
    After a second look - from my understanding there is a dangerous state where the object is close to the cameras. It will calculate the distance from the opposite corners of the object which will return...something that is probably not correct. Good luck! (The object should be symmetrical like a screen so the model predicts it as the same class)

    • @jonathanr4242
      @jonathanr4242  5 หลายเดือนก่อน

      Hi. Thanks for your comment. Yes the project needs a bit of tweaking. There are probably a fair few heuristics one can employ. In fact, every AI project will have some tweaks applied that will greatly improve them.
      Also, I want to say that I discovered in another project that when I bring in the image from the esp32 cam, I'm taking it from a queue, so that I don't get the most recent image, I can get a bit of a delay. To fix this, I found this wrapper for the image capture stuff which avoids this bug. I've used the wrapper in another video with the time of flight camera. The github repo should be referenced in the description of that project. So basically, it uses a thread to keep reading eliminating frames in the queue so that when you want to grab a frame, it grabs the most recent, which is essential if you have moving objects.

  • @sidharthpisharody
    @sidharthpisharody 7 หลายเดือนก่อน +1

    When you replace d3 with dc, what should be the pixel count p. Will it remain the same as the number of pixels of object or the pixels coming within dc distance?

    • @jonathanr4242
      @jonathanr4242  7 หลายเดือนก่อน

      The pixels making up distance dc. I'm trying to replace all distance measurements with pixel counts, so that it makes sense in digital image processing.

  • @JamesNewton
    @JamesNewton ปีที่แล้ว +2

    Have you heard of the old laser line trick for doing distance measurements? Super cheap, super fast, one camera and a (you guessed it) laser line. This is nice, because it tells you want the object is, which the laser line won't do. Nice work!

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      Thanks James. I will look at the laser line trick. I am planning to add a LIDAR to the esp32 camera at some stage. Hopefully, this will happen in the near future sometime.

    • @JamesNewton
      @JamesNewton ปีที่แล้ว +3

      @@jonathanr4242 LiDAR are the cool toy everyone wants to play with. And they are wonderful, not doubt. A little spendy, but I'm seeing that the prices are getting down into the below $100 range? Hmmm... looks like a lot of those are NOT really LiDAR. LOL. The real problem with LiDAR and most 3D cameras is that they bury you in data. In the same way that the NN takes several seconds to lock in on the objects in this project, sorting through the LiDAR data and making use of it takes time. The laser line thing is nearly instant. It's a great first shot at ranging, which then helps things like image recognition because instead of handing the NN an entire frame, you can hand it just the one area where the object is known to be. That really speeds up recognition. And the total cost is like $5... or less actually, you can get cheap modules for a $1 from china.

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      Thank you, James. I will definitely look into it. Is the module you're referring to a time of flight sensor? Do you have any references for projects that have done this?

    • @JamesNewton
      @JamesNewton ปีที่แล้ว +4

      @@jonathanr4242 I did it multiple times back in the 80's and it's been done many times since then but usually in slightly different form. It's nothing more than a laser line connected to an io pin to turn it on and off, and a camera. You capture a frame with the laser line off, then turn it on and capture another frame. Subtract the two frames, and the only thing left will be any reflections of the laser line against objects in front of the camera. Because of parallax between the line and the camera (the line is placed a bit below the camera), the distance from the bottom of the picture, up to the pixel where the line shows up, will be the range to the object. To clarify, the image subtraction is pixel by pixel. The first pixel (upper left corner) of the first image has a value (or values in RGB) and you subtract that value from the first pixel of the second image, making a new image with a pixel value which is the difference. One common use is with a turntable to make a low cost 3D scanner. like this from my site:
      techref.massmind.org/techref/new/letter/news0310.htm
      I'm suggesting that you don't use the turntable and you turn the laser and camera on their side. The laser projects out a plane under the camera, and only detects things that are resting on the surface. But then... what doesn't rest on the surface?

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว +1

      @@JamesNewton Thanks James! That sounds awesome. It gives me so many ideas. I had a project where I tried to use a laser to guide an electric water gun to the target. But I couldn't get this to work because the red light from the laser saturated the camera (showed up as all white), so it didn't work very well. I think you've just told me how I can solve this.

  • @tusheyy6033
    @tusheyy6033 หลายเดือนก่อน +1

    hello! the feed of the camera appears to be slow when object recognition and distance estimation are running, how do i speed it up?

    • @jonathanr4242
      @jonathanr4242  หลายเดือนก่อน

      Thanks for your message. There are a couple of things you can do.
      1. If you want to speed up the object recognition part, you can use YOLOv5 or YOLOv8 instead of r-cnn I am using. This is much faster. Also running it on a GPU will make it faster. I have implemented this in the next video. The code is on this github repo: github.com/jonathanrandall/autonomous_task_stereo_cam
      2. Secondly, if you just want to speed up the video capture part, you can try running on a separate thread. See the solution to this question on stackoverflow: stackoverflow.com/questions/43665208/how-to-get-the-latest-frame-from-capture-device-camera-in-opencv
      You might want to try moving the imshow inside the videocapture class if you implement the above thread.

  • @mohammedrhaiouz2668
    @mohammedrhaiouz2668 หลายเดือนก่อน +1

    Hi i just wanna ask you if i can use this camera to detect only white ball and estimate the ball'trajectory?? W8ing for ur answer please 🙏🏻

    • @jonathanr4242
      @jonathanr4242  หลายเดือนก่อน +1

      Thanks for your question. You’d be better off looking at key point detection in something like yolov8 or v9.

  • @utkucicek6664
    @utkucicek6664 20 วันที่ผ่านมา +1

    Do you think it would be possible to do slam with the stereo cameras? My only worry is the processing power. Thanks for sharing this tutorial!

    • @jonathanr4242
      @jonathanr4242  20 วันที่ผ่านมา +1

      Yes. 100%. There is a great clip of Elon Musk with Andrej Karpathy saying lidar is pointless for cars. But you’re right about the processing power, especially for a small hobby robot. As well, there are still challenges that need to be addressed, like visual understanding.

  • @kevindarren756
    @kevindarren756 3 หลายเดือนก่อน +1

    Amazing video, just got some minor questions:
    - Is it a fair assumption to make that the focal length of this ov2640 is the same for all the of them from different esp32cams?
    - Have you worked on calibrating the sensor to get the projection matrix?
    Regards,
    Kevin

    • @jonathanr4242
      @jonathanr4242  3 หลายเดือนก่อน

      Hi Kevin. Thanks for your comment. In my experience, ov2640 from different sellers (buying from ali express) can have varying degrees of quality. So, I would say you need to calibrate every one of them. I haven't really spent much time on calibration. I usually just plug in the sensor and hope for the best.

  • @Mohammedalhalabi16
    @Mohammedalhalabi16 ปีที่แล้ว +1

    Question: If you disconnect the internet from it, will it still work on tracking moving objects without its connection to the internet, or should it be connected to the internet in order to continue tracking the movement of objects, and if it works, can we make a project to track an object using Arduino Uno

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      You can run it on a local network, but it needs a computer that can run open cv and pytorch.

  • @MunavirZamanPK
    @MunavirZamanPK 24 วันที่ผ่านมา

    thank you for the video
    how to upload code bez there is no port?
    is there any hardware circuit diagram

    • @jonathanr4242
      @jonathanr4242  23 วันที่ผ่านมา

      you need to use an FTDI to connect the ai thinker to a usb. You can get these from Amazon or aliexpress. There are other esp32 cams which have a usb conneciton, like the XAIO esp32 cam. They're generally a bit more expensive, but worth it.

  • @ansjaved3484
    @ansjaved3484 ปีที่แล้ว +2

    Hi, I appreciate your work. You have done an excellent tutorial. I would like to ask about the system requirements to run the MaskRCNN_ResNet50_FPN_V2. Can you tell me about it? As I'm doing the same work but running on jetson nano. I'm not able to find a quite fast object detection model with good accuracy. I will appreciate your help. Thanks!

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว +1

      Hi Ans. Thank you for your kind words. I am using an old-ish lenovo yoga slim laptop. It takes around 5 seconds per image. In my next video I use YOLOV8 (model m), which is about 10 times faster. I haven't tried on a jetson nano, but I would like to. Please let me know how it goes.

  • @darthvader4899
    @darthvader4899 5 วันที่ผ่านมา +1

    great amazing video! another great video buried under you tube cemetery

  • @junyang1710
    @junyang1710 6 หลายเดือนก่อน +1

    Can you do a project, which uses only one camera and odometry to detect if the distance between two cars is enough for my car to side parking? Sometimes it is very hard to park if the distance is not enough.

    • @jonathanr4242
      @jonathanr4242  6 หลายเดือนก่อน

      Hi. I think it would be possible, but would require decent labeled data for training.

  • @arnabbarua1670
    @arnabbarua1670 11 หลายเดือนก่อน +1

    Amazing video and well explained
    Do you have any video, on how we could implement stereovision on human flow with esp32cams in realtime?

    • @jonathanr4242
      @jonathanr4242  11 หลายเดือนก่อน +1

      Thank you, Arnab. I'm working on something at the moment with a depth camera that might help. Hoping to have it out by the end of this month or middle next month at the latest. R-CNN is a bit slow, might be better to use YOLO if you want to do it in real time. In the video after this (esp32cam for autonomous task completion) , I do an implementation with YOLO, which is much faster than the R-CNN implementation.

    • @arnabbarua1670
      @arnabbarua1670 11 หลายเดือนก่อน +1

      @@jonathanr4242 Yes, I have been seeing you are doing a lot of work with esp32 cams and implementing it with various case. I have seen the video on your yolov8 implementation.
      I have one concern though, As i am working on people counting, i might use it at night too. In that case, can stereovision help with IR footage?

    • @jonathanr4242
      @jonathanr4242  11 หลายเดือนก่อน +1

      @@arnabbarua1670 I don't see there would be an issue with using two IR cameras, as long as you can get a decent image from each camera and do feature mapping. As well you can estimate the size of the person, so can estimate how far a way they are from a single camera, which might help a bit. I mean it might be an advantage if you have both cameras with IR LEDs because you'd have more IR light, but on the other hand there might be too much IR light (I'd say the more the better, given those IR LEDs are not very bright). Sounds very interesting. I was looking at one of those the other day for raspberry pi, but it was a bit more expensive than I wanted to spend.

  • @jacekf
    @jacekf 8 หลายเดือนก่อน +1

    Binocular camera product introduction:
    Based on the OV2640 camera, the binocular camera switches display, the pin is compatible with the standard 24P camera, and can be directly connected through the 24P FPC cable.
    The binocular distance is 60mm, which is close to the real body distance of the human body. It can be used for deep visual research. MaixPy provides a binocular operation interface.

  • @Moon-D0G
    @Moon-D0G 2 หลายเดือนก่อน +1

    Thanks for the great video. Will this process be faster with my own gpu ? I'm thinking to implement this to my school project. Camera will track the object, i thought it will be more precise but if it's thaat slow 😢

    • @jonathanr4242
      @jonathanr4242  2 หลายเดือนก่อน

      It will be much faster with a gpu. Also YOLOv8 will be faster than mask r-cnn.

  • @mohamedmohsen1859
    @mohamedmohsen1859 ปีที่แล้ว +1

    Hello, thanks for this great video, I tried using your code with different images it gives me wrong distances, it only works with the image in your repo

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      Did you calibrate your camera?

  • @welsonfy5246
    @welsonfy5246 10 หลายเดือนก่อน +1

    Hello, good job! with stereo cameras where can we find the width, height, fx, fy, cx, cy values ​​from a photo taken or a video? Thanks in advance

    • @jonathanr4242
      @jonathanr4242  10 หลายเดือนก่อน +1

      Thanks. Most of the calibration information should be in the python notebooks on the github.

    • @jonathanr4242
      @jonathanr4242  10 หลายเดือนก่อน +1

      Also, I should probably mention that my approach is fairly simple. I'm only using the distance of the object from the camera, and the distance apart of the two cameras. My assumption is that both cameras have the same focal length.

    • @welsonfy5246
      @welsonfy5246 9 หลายเดือนก่อน

      Thank you but the parameters when taking the photo or video which concerns the values ​​width, height, fx, fy, cx, cy that I quoted how are they found?

    • @jonathanr4242
      @jonathanr4242  9 หลายเดือนก่อน

      @@welsonfy5246 I think my approach is a little different to what you find in the text book. I have two parameters, which are tan(theta) and focal length. And then, I take the corner point of the object as the matching point in each image to calculate the distance. I go through the calculations in the video, and they should be included in my code. There is room for improvement. For example, one improvement could be once you find the matching objects you can correct for parallax.

  • @amalcz6066
    @amalcz6066 2 หลายเดือนก่อน +1

    hi, currently i doing distance estimation project from your video. i made the set up exactly like yours. esp32 cam kept at 7.05 cm away. do i need to calibrate my camera, or can i use the calculated value from the video for focal length and tantheta, since we are using the same setup? what are the values should i change in the code for tunning it, apart from focal length and tantheta? I have to submit the project soon😢😂

    • @jonathanr4242
      @jonathanr4242  2 หลายเดือนก่อน

      You need to calibrate the camera. Basically, you need to measure the same object at a few different distances and adjust the focal length and tantheta.

  • @59vijaiyaaravindthsr39
    @59vijaiyaaravindthsr39 9 หลายเดือนก่อน +1

    what the maximum range it can deduct the distance of an object?

    • @jonathanr4242
      @jonathanr4242  9 หลายเดือนก่อน

      Thanks for your thought inspiring comment. Using 640x480 resolution, I think I was able to accurately measure up to around 120cm (if I remember correctly ?). As the resolution increases, so does the maximum distance. You can calculate the theoretical maximum by taking the left/right distance apart as 1 pixel. But I would probably divide this by at least three, taking into account practical considerations. But then I guess the distance resolution would also decrease as the distance away increased.

  • @Sumathy.v
    @Sumathy.v 11 หลายเดือนก่อน +1

    I am new to embedded system,,can we suggest me where can i learn esp32 programing(not using adruino ide)

    • @jonathanr4242
      @jonathanr4242  11 หลายเดือนก่อน

      My go to pla es are dronebot workshop, random nerd tutorials and Paul mcwhorter from toptechboy.com, the best way to learn is to do projects.

    • @Sumathy.v
      @Sumathy.v 11 หลายเดือนก่อน

      @@jonathanr4242 hey man ...thanks for the reply u gain my respect as a teacher 🛐

  • @m7mold128
    @m7mold128 7 หลายเดือนก่อน +1

    can you do a tutorial on connecting the hardware

    • @jonathanr4242
      @jonathanr4242  7 หลายเดือนก่อน

      Thanks for your comment. It's actually very simple to connect the hardware. All I do is make sure the two esp32 cams are on a grid, so that they are fixed in position, and then upload the sketch to them. In all honesty, the kind of stereo cam I've used is a bit messy. I would probably recommend using a proper usb type stereo camera.

  • @shreshthjha997
    @shreshthjha997 ปีที่แล้ว +1

    Hello sir.. i am doing a project exactly same as you have done in the video previously i was using 1 esp32 camera and everytime i got wrong distance of the object..sir can you help if i use 2 esp32 can i do it like you

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      You need to use two esp32.

    • @shreshthjha997
      @shreshthjha997 ปีที่แล้ว

      ​@@jonathanr4242 Sir, Should I train model for object detection? I had used two esp32 like you but it was not detecting any object like person etc. Can you please help?

  • @waltherhumberto271
    @waltherhumberto271 2 หลายเดือนก่อน +1

    Can I create a model on YOLO8 to find especifc mouses , can I simple deploit in the program ?

    • @jonathanr4242
      @jonathanr4242  2 หลายเดือนก่อน

      Yes. The only issue is you will need lots of training data.

    • @waltherhumberto271
      @waltherhumberto271 3 วันที่ผ่านมา

      @@jonathanr4242 thank you, I did and now is working perfect, one thing yet is that ,I saw your video many times, and I still didn't get the ideia behind the focus legnth calculation, could you write a resume here please ?

    • @waltherhumberto271
      @waltherhumberto271 3 วันที่ผ่านมา

      or give any paper ?

    • @waltherhumberto271
      @waltherhumberto271 3 วันที่ผ่านมา

      @@jonathanr4242 I didn't get how you got the formula f = 30-38*50/68

    • @jonathanr4242
      @jonathanr4242  วันที่ผ่านมา

      @@waltherhumberto271 this is the formula at the bottom right at 15:38. I’ve placed the object 30cms away for calibration and measured the other values manually.

  • @jacekf
    @jacekf 8 หลายเดือนก่อน +1

    1pcs Sipeed OV2640 Binocular Camera Development Board Stereo Vision Depth Vision

  • @lancelotnub
    @lancelotnub 9 หลายเดือนก่อน

    sir, how to solve it ?
    Traceback (most recent call last):
    File "C:/Users/ASUS/OneDrive/Documents/test.py", line 59, in
    faces = face_classifier.detectMultiScale(gray)
    cv2.error: OpenCV(4.5.3) C:\Users
    unneradmin\AppData\Local\Temp\pip-req-build-uzca7qz1\opencv\modules\objdetect\src\cascadedetect.cpp:1689: error: (-215:Assertion failed) !empty() in function 'cv::CascadeClassifier::detectMultiScale'

    • @jonathanr4242
      @jonathanr4242  9 หลายเดือนก่อน

      It's difficult to tell without know what part of the code is causing the exception. Usually, I would try to isolate the error.

  • @a7medrafat31
    @a7medrafat31 ปีที่แล้ว +1

    I have a problem with the weights file i dwonloaded it but still got the same problem

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      what is the error that you are getting?

    • @a7medrafat31
      @a7medrafat31 ปีที่แล้ว +1

      @@jonathanr4242 pytorchstream readerfailed reading zip archive failed finding central directory

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      @@a7medrafat31 Oh! I've never seen that error. In the next video, I do the stereo pair matching with yolov8. I've put the notebooks on the github for those as well. It's much faster than pytorch, but not as reliable. And you can tweak it, I suppose. The git repository is called autonomous_task_stereo_cam

    • @a7medrafat31
      @a7medrafat31 ปีที่แล้ว

      @@jonathanr4242 i resolved the error but i want to know how to make inference continously ?

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      @@a7medrafat31 You will probably need a stereo camera that can synchronise the left and right images. If you're using the esp32-cam, there will be a lag between left and right images.
      Also, you will most likely need to implement the object segmentation and detection on a GPU to get it to keep up with the frame rate.

  • @AbangPoi
    @AbangPoi 8 หลายเดือนก่อน

    Hello, sir. Thank you for your content which helps me to complete my assignments at school. I'm from Indonesia, I'm confused about how to take an image from the file name so that it displays two different images, for example the left image is 50cm and the right image is 50cm, can you help me sir?

    • @jonathanr4242
      @jonathanr4242  8 หลายเดือนก่อน

      Hi. I’m not sure I understand the question.

  • @HilalGezgin
    @HilalGezgin ปีที่แล้ว +1

    Is it possible to do this with a single camera?

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      I don’t think so. It’s specifically designed to be used with two cameras.

    • @EdFrench_uk
      @EdFrench_uk 11 หลายเดือนก่อน

      If you know the target is stationary you can move the camera to get multiple shots to do the trigonometry

  • @truetech4158
    @truetech4158 10 หลายเดือนก่อน +1

    It would be nice to have stereoscopic cams linked to a VR eyewear setup, and the cams mounted on the face of a droid robot that walks around, realism for when you are too lazy to go to the mall, but also want to your robot to provide for realism as if you were at the mall.
    Stereoscopic hearing, sight, even stereo nostril sensors to detect from where exactly smells are occuring from.
    This could detect covid in a crowd this way if anyone there farts and expells a observable testable dangerous toxic invisible cloud, a life saving alert can be given to people preemptively.

    • @jonathanr4242
      @jonathanr4242  10 หลายเดือนก่อน

      That sounds very cool, indeed. It would be amazing to build something like that.

  • @hamid9083
    @hamid9083 8 หลายเดือนก่อน

    Hi, thank you for this great video. I learned a lot from you just in this one video. But I think you made a mistake in one part of the code to calculate the focal length.
    you use this formula: f1=30-38.44*50/68.75
    I think you must use this: f1=(30-38.44*50/68.75)/(1-38.44/68.75)
    I also have a question. Can we use this distance to get the horizontal and vertical distance of the object to the camera?

    • @jonathanr4242
      @jonathanr4242  8 หลายเดือนก่อน

      Hi. Thanks for your comment. Yes. It’s not difficult to measure height and width. You can do it from the geometry by taking the ratio of pixels across to the pixel width.

  • @sdpatsdp
    @sdpatsdp 3 หลายเดือนก่อน +1

    I’m interested in stereo vision how can we get in touch?
    Regards.

    • @jonathanr4242
      @jonathanr4242  3 หลายเดือนก่อน

      Thanks for your comment. You can ask any questions here or email me.

    • @sdpatsdp
      @sdpatsdp 3 หลายเดือนก่อน

      @@jonathanr4242 ciao. YT is non publishing my answer. What is your email address?

  • @mohamedabdelmalekbouchrika6134
    @mohamedabdelmalekbouchrika6134 8 หลายเดือนก่อน +1

    can it be in realtime detection?

    • @jonathanr4242
      @jonathanr4242  8 หลายเดือนก่อน

      Yes. You would need to run it on a gpu, and also YOLO would be a bit faster.

  • @goldcoasttime
    @goldcoasttime ปีที่แล้ว +1

    johnathon where do you live are you interested in colaborating on a project i'm on the gold ccoast

    • @jonathanr4242
      @jonathanr4242  ปีที่แล้ว

      Thanks for you comment. I am quite far from the gold coast. I am in Sydney. Happy, if you have any ideas you want to bounce off me.

  • @batuhanbatuhan4131
    @batuhanbatuhan4131 ปีที่แล้ว

    I got offended by your productivity.