How I got my MS Thesis Idea

After Cross View Retrieval, while working on Cyclist Detection, I came across some functions as part of scripts provided by the authors of the Tsinghua-Daimler Cyclist Detection Dataset.

As I wasn’t able to cater to small objects to improve cyclist detection, I thought about indirectly predicting camera parameters while directly converting 2D points to 3D by embedding the above equations in a CNN with the help of Lambda layers.

But first I had to see whether what I was thinking had already been done or not. For this, I performed a literature review.

Overview of configurations for different aspects of Camera Calibration can be summarized as a table.

So after the literature review, I knew that I can go ahead with my idea which is obviously what I did. First thing was to represent mathematical equations in the form of lambda layers.

I named the lambda layer representation as Camera Projection Loss.

After designing the network, we came across another problem which was the unavailability of enough camera configurations with all the required parameters which were 13 in our case. So we decided to generate our own dataset using CARLA. We were able to generate 50 camera configurations using 2 towns having 25 configurations each but one episode was removed by mistake so the remaining 49 episodes were used. For the real dataset, we used the Tsinghua-Daimler Cyclist Detection Dataset.

Our method performed better than other methods on 7 out of 10 parameters on both synthetic and real data.

So that’s it for now, See you later.

References

--

--

--

MS Thesis Student, CVGL, LUMS http://pk.linkedin.com/in/talhahanifbutt

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Slime vs Me, Myself, & I

Using CNN to build a sneaker authenticator: Image Preprocessing and Modeling (3/3)

Unraveling Automatic Differentiation

Using image data, predict the gender and age range of an individual in Python.

Building a Job Recommender for Non-Technical Business Roles via NLP and Machine Learning

Vision Recognition with Clarifai

A survey on methods and applications of meta-learning with GNNs

Build and deploy a Flask ML Web Application

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Talha Hanif Butt

Talha Hanif Butt

MS Thesis Student, CVGL, LUMS http://pk.linkedin.com/in/talhahanifbutt

More from Medium

Jumping Trees

Running for Office in 2023

My experience of Summer Of Bitcoin’21

California Scholarship Federation: A Scholarship for Every Student