courses:2023:cs551

Writing /home/fac/arijit/public_html/dokuwiki/data/cache/3/350e74dfe960feab2af788a1de0220cc.metadata failed
Writing /home/fac/arijit/public_html/dokuwiki/data/cache/b/b964d435b0ae03744a00f29f180666c0.xhtml failed

This is an old revision of the document!


CS551: Introduction to Deep Learning

This course will provide a basic understanding of deep learning and how to solve problems from varied domains. Open source tools will be used to demonstrate different applications.

  • Monday - 1700-1800
  • Thursday - 1500-1600
  • Friday - 1600-1700

Brief introduction of big data problem. Overview of linear algebra, probability, numerical computation. Basics of Machine learning/Feature engineering. Neural network. Tutorial for Tools. Deep learning network - Shallow vs Deep network, Deep feedforward network, Gradient based learning - Cost function, soft max, sigmoid function, Hidden unit - ReLU, Logistic sigmoid, hyperbolic tangent Architecture design, SGD, Unsupervised learning - Deep Belief Network, Deep Boltzmann Machine, Factor analysis, Autoencoders. Regularization. Optimization for training deep model. Advanced topics - Convolutional Neural Network, Recurrent Neural Network/ Sequence modeling, LSTM, Reinforcement learning. Practical applications – Vision, speech, NLP, etc.

  • Ian Goodfellow, Yoshua Bengio and Aaron Courville, “Deep Learning”, Book in preparation for MIT Press, 2016. (available online)
  • Jerome H. Friedman, Robert Tibshirani, and Trevor Hastie, “The elements of statistical learning”, Springer Series in Statistics, 2009.
  • Charu C Aggarwal, “Neural Networks and Deep Learning”, Springer.
Topics Slides Annotated Slides
Introduction pdf NA
Linear algebra pdf NA
  • courses/2023/cs551.1674276477.txt.gz
  • Last modified: 2023/01/21 10:17
  • by arijit