Difference between revisions of "Object Movement Prediction for Autonomous Cars"

From ISLAB/CAISR
(Created page with "{{StudentProjectTemplate |Summary=Predicting the movement of objects in the context of autonomous cars |References=https://motchallenge.net https://github.com/abhineet123/Dee...")
 
m
Line 7: Line 7:
 
|Status=Open
 
|Status=Open
 
}}
 
}}
Right now we have architectures like Yolo that are very good at predicting bounding boxes for objects.  
+
Nowadays, we have several powerful architectures, e.g. YOLO, that allows us to find bounding boxes on the fly.  
But what if we wanted to use that abstraction to try predict the next time-step position of the objects in it? The idea behind this project would be to try to create a model that could, given the n previous frames, predict the following one.
+
 
 +
 
 +
Single-object tracking focus on the processing of sequences of RGB images to be able to identify and track a given object, which can be costly in terms of memory/computation. The main idea being this project is to use the bounding boxes itself and try to predict its movement based on the n-previous frames. By using this higher-level abstraction of the scene itself we might reduce the complexity and training time required for traditional Single-Object tracking.
 +
 
 +
To start, we can use Kitti dataset to create such a prediction system and exploit other possible datasets/possible settings as soon as we have a working prototype.

Revision as of 15:38, 3 October 2019

Title Object Movement Prediction for Autonomous Cars
Summary Predicting the movement of objects in the context of autonomous cars
Keywords
TimeFrame
References https://motchallenge.net

https://github.com/abhineet123/Deep-Learning-for-Tracking-and-Detection

Prerequisites
Author
Supervisor Tiago Cortinhal
Level Master
Status Open


Nowadays, we have several powerful architectures, e.g. YOLO, that allows us to find bounding boxes on the fly.


Single-object tracking focus on the processing of sequences of RGB images to be able to identify and track a given object, which can be costly in terms of memory/computation. The main idea being this project is to use the bounding boxes itself and try to predict its movement based on the n-previous frames. By using this higher-level abstraction of the scene itself we might reduce the complexity and training time required for traditional Single-Object tracking.

To start, we can use Kitti dataset to create such a prediction system and exploit other possible datasets/possible settings as soon as we have a working prototype.