CS551: Introduction to Deep Learning

Writing /home/fac/arijit/public_html/dokuwiki/data/cache/8/829516182c0be3377e262b242b6da3ce.xhtml failed

This course will provide a basic understanding of deep learning and how to solve problems from varied domains. Open source tools will be used to demonstrate different applications.

Monday - 1600-1700
Tuesday - 1700-1800
Wednesday - 1800-1700
All classes will be held in MS Teams.

Jyoti Kumari
Sandeep Patel
Divya Singh
Surbhi Raj
Fazail Amin

Brief introduction of big data problem. Overview of linear algebra, probability, numerical computation. Basics of Machine learning/Feature engineering. Neural network. Tutorial for Tools. Deep learning network - Shallow vs Deep network, Deep feedforward network, Gradient based learning - Cost function, soft max, sigmoid function, Hidden unit - ReLU, Logistic sigmoid, hyperbolic tangent Architecture design, SGD, Unsupervised learning - Deep Belief Network, Deep Boltzmann Machine, Factor analysis, Autoencoders. Regularization. Optimization for training deep model. Advanced topics - Convolutional Neural Network, Recurrent Neural Network/ Sequence modeling, LSTM, Reinforcement learning. Practical applications – Vision, speech, NLP, etc.

Ian Goodfellow, Yoshua Bengio and Aaron Courville, “Deep Learning”, Book in preparation for MIT Press, 2016. (available online)
Jerome H. Friedman, Robert Tibshirani, and Trevor Hastie, “The elements of statistical learning”, Springer Series in Statistics, 2009.
Charu C Aggarwal, “Neural Networks and Deep Learning”, Springer.

Topics	Slides	Annotated Slides
Introduction	pdf	pdf
Linear algebra	pdf	pdf
Feature engineering	pdf	pdf
Neural Networks	pdf	pdf
Deep feed forward networks	pdf	pdf
Introduction to Keras	pdf	NA
Back propagation	pdf	pdf
Regularization	pdf	pdf
Optimization	pdf	pdf
CNN	pdf	pdf
RNN	pdf	pdf
Practical methodologies	pdf	pdf
DRL	pdf	pdf

CS551: Introduction to Deep Learning

Class timings

TAs

Syllabus

Books

Slides