Registry Synced

BSDA5003 - Algorithms for Data Science (ADS)

428 words
2 min read
FieldValue
Course CodeBSDA5003
LevelDegree Level Course
Credits4
TypePre-requisites:
Pre-requisitesBSCS2007 - Β Machine Learning Techniques

πŸ“– Description

The aim of this second-level graduate course is to provide a broad overview and develop the tools and methods necessary for the large-scale problems that naturally arise in many data science-related application areas.

πŸ—“οΈ Weekly Syllabus

WeekTopic
Week 1Foundations of Randomized Methods & Concentration Inequalities
Week 2Randomized SVD – I: Basics & Sampling Techniques
Week 3Randomized SVD – II: Applications to PCA & Dimensionality Reduction
Week 4Graph-Based Learning – I: Spectral Graph Theory, Clustering, Community Detection
Week 5Graph-Based Learning – II: Graph-Based Ranking
Week 6Dimension Reduction with Johnson-Lindenstrauss Lemma
Week 7Approximate Nearest Neighbors (ANN) – I: LSH & Similarity Search
Week 8Approximate Nearest Neighbors (ANN) – II: MinHash, SimHash, Bloom Filters
Week 9Randomized Methods for Regression
Week 10Matrix Sketching for Machine Learning
Week 11Streaming Algorithms – I: Count-Min Sketch, Heavy Hitters, Frequency Moments
Week 12Streaming Algorithms – II: Reservoir Sampling, Graph Streams, Streaming PCA

πŸ“š Books & Resources

Prescribed Books The following are the suggested books for the course:
        A. Blum, J. Hopcroft, and R. Kannan (2020) Foundations of Data Sciences, Cambridge  University Press
        
        M. W. Mahoney (2010) Randomized Algorithms for Matrix and Data, Foundations and Trends in Machine Learning, pages 123-224

πŸ“ About the Instructors

Arun Rajkumar
Assistant Professor,
Department of Data Science and AI,
IIT Madras
I am currently an Assistant Professor at the Data Science and AI department of IIT Madras. Prior to joining IIT Madras, I was a research scientist at the Xerox Research Center (now Conduent Labs), Bangalore for three years. I earned my Ph.D from the Indian Institute of Science where I worked on 'Ranking from Pairwise Comparisons'. My research interests are in the areas of Machine learning, statistical learning theory with applications to education and healthcare.
less
Visit website
Other courses by the same instructor:
BSCS2004 -
Machine Learning Foundations
,
BSCS2007 -
Machine Learning Techniques
and
BSDA5007 -
Special topics in Machine Learning (Reinforcement Learning)

Document Outline
Table of Contents
System Normal // Awaiting Context

Intelligence Hub

Navigate the knowledge graph to generate context. The Hub adapts dynamically to surface backlinks, related notes, and metadata insights.