Ishan P
- Research Program Mentor

MS at Stanford University

Expertise

Computer Science, Machine Learning and AI (Computer Vision, Robotics, Large Language Models, Generative AI)

Bio

Hi! I currently work as a Machine Learning Engineer for Apple's AI/ML team. In the past, I have worked as a Deep Learning Engineer at Focal Systems, a retail AI startup trying to automate the grocery store by predicting product inventory on shelfs using cutting edge computer vision neural networks, and prior to that, as a Software Engineer at Amazon, where I was one of the founding members of the Robotics AI team that launched the Amazon Astro home robot. I have completed my MS from Stanford University, where I carried out ML research as part of Stanford Vision Lab and Stanford School of Medicine, and received my B. Tech from Indian Institute of Technology, Indore. Outside of work and research, I love spending time outdoors - road biking, running, playing tennis and more recently skiing!

Project ideas

Project ideas are meant to help inspire student thinking about their own project. Students are in the driver seat of their research and are free to use any or none of the ideas shared by their mentors.

Predicting dance genre from street dance videos

In this project, we will build a machine learning model to identify the genre of street dance (jazz/ hip-hop/ break etc.) from input demonstration videos. Starting with the AIST Dance Video Database (https://aistdancedb.ongaaccel.jp/), we will first extract frames from each video to convert the video data into a format suitable for training a neural network using libraries like OpenCV. We will then define the model architecture using a popular library like PyTorch. This could be a deep learning model that needs to understand both the semantic content in each frame (CNN) and temporal context across all the frames (RNN/ LSTM) to predict the genre. We will then evaluate this model using test data (from the same dataset), and further look into ways we can improve it. Finally, we can now use this model to classify any other street dance videos (not seen during training). Could also be your own street dance video!

Coding skills

Python, C++, Matlab, Bash, SQL

Languages I know

Hindi, advanced; Marathi; native

Teaching experience

I have mentored undergrad and grad level students, multiple times as a teaching assistant for Stanford's on campus introductory Machine Learning and Deep Learning classes. This involved helping them with their course assignments as well as guiding them through course projects. In addition to this, I have also had experience mentoring undergrad students at the Stanford Office of Accessible Education.

Credentials

Work experience

Apple Inc (2022 - Current)

Senior Machine Learning Engineer

Focal Systems (2021 - 2022)

Deep Learning Engineer

Amazon Inc. (2018 - 2021)

Software Engineer

Nvidia (2017 - 2017)

Deep Learning Intern

Education

Indian Institute of Technology, Indore

BTech Bachelor of Technology (2016)

Electrical Engineering

Stanford University

MS Master of Science (2018)