Same school, new name. The School of Informatics and Computing is changing its name effective January 11, 2023. Learn more about the name change

INFO-I 418 Deep Learning Neural Networks

3 credits

  • Prerequisites: INFO-B 210 or CSCI-A 201 or CSCI 23000; Recommended: PBHL-B 302 (or equivalent) or INFO-I 415
  • Delivery: On-Campus
  • Deep learning has resurged with the availability of massive datasets and affordable computing, enabling new applications in computer vision and natural language processing. This course introduces convolutional, recurrent, and other neural network architectures for deep learning. Students design, implement, and train these models to solve real-world problems.

    Learning Outcomes

    1. Solve problems in linear algebra, probability, optimization, and machine learning.
    2. Evaluate, in the context of a case study, the advantages and disadvantages of deep learning neural network architectures and other approaches.
    3. Implement deep learning models in Python using the PyTorch and TensorFlow libraries and train them with real-world datasets.
    4. Design convolution networks for handwriting and object classification from images or videos.
    5. Design recurrent neural networks with attention mechanisms for natural language classification, generation, and translation.
    6. Evaluate the performance of different deep learning models (e.g., with respect to the bias-variance trade-off, overfitting and underfitting, estimation of test error).
    7. Perform regularization, training optimization, and hyperparameter selection on deep models.
    8. Analyze a deep learning model’s hardware node and GPU scalability in preparation for deployment.

    Syllabi

    There is not a syllabus available for this course.