Same school, new name. The School of Informatics and Computing is changing its name effective January 11, 2023. Learn more about the name change

INFO-I 428 Web Mining

3 credits

  • Prerequisites: INFO-B 210 or CSCI-A 204 or CSCI 23000
  • Delivery: On-Campus, Online
  • This course covers concepts and methods used to search the web and other sources of unstructured text from a human-centered standpoint. These include document indexing, crawling, classification, and clustering; distance metrics; analyzing streaming data, such as social media; link analysis; and system evaluation.

    Learning Outcomes

    1. Implement web search concepts and methods to return documents automatically based on user queries.
    2. Design and implement a crawler application to collect and index documents from the web.
    3. Design computational methods to classify documents by topic.
    4. Use distance metrics to compute the similarity of pairs of documents.
    5. Create a system to collect and analyze streaming data.
    6. Use link analysis to rank web search results.
    7. Evaluate the performance of web search systems.
    8. Analyze text to determine the reliability of the information including potential bias.

     

    Syllabi

    There is not a syllabus available for this course.