Applications Open now for January 2025 Batch | Applications Close: January 02, 2025 | Exam: February 23, 2025

Applications Open now for January 2025 Batch | Applications Close: January 02, 2025 | Exam: February 23, 2025

Degree Level Course

Deep Learning Practice

-Recognise the full stack of deep learning - datasets, frameworks, hardware for training, deployment across devices, interpretability, and security -Use tools to improve deep learning practice throughout the entire stack -Apply best practices in training and deployment, even under constraints of data and hardware -Build confidence of training models of real-world scale -Identify problems of social relevance that are solvable with deep learning

by Prof. Mitesh M.Khapra , Prof. S. Umesh , Dr. Kaushik Mitra

Course ID: BSDA5013

Course Credits: 4

Course Type: Elective

Pre-requisites: BSCS3004 -  Deep Learning

Course structure & Assessments

For details of standard course structure and assessments, visit Academics page.

WEEK 1 Introduction to Modern NLP and HF
WEEK 2 Tokenization
WEEK 3 Fine-tuning models for downstream tasks
WEEK 4 Continual Pre-training and Instruction Tuning
WEEK 5 Spoken Language Identification
WEEK 6 Speaker Diarisation (identifying speakers and when they are spoken in a conversation)
WEEK 7 Speech to Text and Text to Speech synthesis
WEEK 8 Wake word detection like "Hey Google" or "Alexa" with personalization
WEEK 9 Image Classification
WEEK 10 Object detection
WEEK 11 Image based depth estimation
WEEK 12 Image super-resolution
+ Show all weeks

About the Instructors

Prof. Mitesh M.Khapra
Associate Professor, Department of Computer Science and Engineering, IIT Madras

Mitesh M. Khapra is an Associate Professor in the Department of Computer Science and Engineering at IIT Madras and is affiliated with the Robert Bosch Centre for Data Science and AI. He is also a co-founder of One Fourth Labs, a startup whose mission is to design and deliver affordable hands-on courses on AI and related topics. He is also a co-founder of AI4Bharat, a voluntary community with an aim to provide AI-based solutions to India-specific problems. His research interests span the areas of Deep Learning, Multimodal Multilingual Processing, Natural Language Generation, Dialog systems, Question Answering and Indic Language Processing. Prior to IIT Madras, he was a Researcher at IBM Research India for four and a half years, where he worked on several interesting problems in the areas of Statistical Machine Translation, Cross Language Learning, Multimodal Learning, Argument Mining and Deep Learning. Prior to IBM, he completed his PhD and M.Tech from IIT Bombay in Jan 2012 and July 2008 respectively.During his PhD he was a recipient of the IBM PhD Fellowship (2011) and the Microsoft Rising Star Award (2011). He is also a recipient of the Google Faculty Research Award (2018), the IITM Young Faculty Recognition Award (2019) and the Prof. B. Yegnanarayana Award for Excellence in Research and Teaching (2020).

  less

Other courses by the same instructor: BSCS3004 - Deep Learning and BSCS5001 - Large Language Models

Prof. S. Umesh
Professor, Department of Electrical Engineering, Indian Institute of Technology, IIT Madras

S. Umesh is a  Professor of Electrical Engineering at IIT-Madras. He completed his PhD from the University of Rhode Island,USA and his PostDoctoral Fellowship from the City University of New York. He has also been a visiting researcher at AT&T Research Laboratories, USA; at Machine Intelligence Laboratory Cambridge University Engineering Department, UK and the Department of Computer Science, RWTH-Aachen, Germany.

...  more

He is a recipient of the AICTE Career Award for Young Teachers in 1997 and the Alexander von Humboldt Research Fellowship in 2004.  During his stint at Cambridge University in 2004, he was part of the U.S. DARPA's Effective, Affordable Reusable Speech-to-text (EARS) programme. Similarly in 2005 he was part of the RWTH-Aachen's TC-STAR project for transcription of speech from European Parliament's Plenary Sessions. Between 2010-2016, he led a multi-institution consortium to develop ASR systems in Indian languages in the agriculture domain which was funded by MeiTY. He is currently leading the ASR efforts for the Natural Language Translation Mission managed by the Office of Principal Scientific Adviser of Govt. of India.

  less

Other courses by the same instructor: BSEE4001 - Speech Technology

Dr. Kaushik Mitra
Assistant Professor, Department of Electrical Engineering, IIT Madras

Dr. Kaushik Mitra leads the Computational Imaging lab at Indian Institute of Technology (IIT) Madras. He is an Assistant Professor in the department of Electrical Engineering, IITM. Before joining IITM, he was a postdoctoral research associate with Ashok Veeraraghavan at ECE department of Rice University. Here, he worked on framework for analysis of Computational Imaging (CI) systems. Prior to that, he earned his Ph.D in Electrical and Computer Engineering from the University of Maryland, College Park, where his research focus was on the development of statistical models and optimization algorithms for computer vision problems. Currently, his research interests are in computational imaging, computer vision and machine learning.

  less