Same school, new name. The School of Informatics and Computing is changing its name effective January 11, 2023. Learn more about the name change

INFO-H 518 Deep Learning Neural Networks

3 credits

  • Prerequisites: Linear algebra, probability and statistics, partial derivatives, and programming. Note: Programming is in Python
  • Delivery: On-Campus
  • Deep learning has resurged with the availability of massive datasets and affordable computing, enabling new applications in computer vision and natural language processing. This course introduces convolutional, recurrent, and other neural network architectures for deep learning. Students design, implement, and train these models to solve real-world problems.

    Learning Outcomes

    1. Solve problems in linear algebra, probability, optimization, and machine learning.
    2. Evaluate, in the context of a case study, the advantages and disadvantages of deep learning neural network architectures and other approaches.
    3. Implement deep learning models in Python using the PyTorch library and train them with real-world datasets.
    4. Design convolution networks for handwriting and object classification from images or video.
    5. Design recurrent neural networks with attention mechanisms for natural language classification, generation, and translation.
    6. Evaluate the performance of different deep learning models (e.g., with respect to the bias-variance trade-off, overfitting and underfitting, estimation of test error).
    7. Perform regularization, training optimization, and hyperparameter selection on deep models.
    8. Analyze a deep learning model’s hardware node and GPU scalability in preparation for deployment.


    INFO H611