Neural Sync Active
BSDA5001 - Introduction to Big Data
Registry Synced
BSDA5001 - Introduction to Big Data
474 words
2 min read
| Field | Value |
|---|---|
| Course Code | BSDA5001 |
| Level | Degree Level Course |
| Credits | 4 |
| Type | Elective |
| Pre-requisites | None |
| Videos | YouTube Playlist |
📖 Description
This course will introduce students to practical aspects of analytics at a large scale, i.e. big data. The course will start with a basic introduction to big data and cloud concepts spanning hardware, systems and software, and then delve into the details of algorithm design and execution at large scale.
🗓️ Weekly Syllabus
| Week | Topic |
|---|---|
| Week 1 | Introduction: Big data concepts & GCP Platform Setup |
| Week 2 | Cloud concepts: Cloud-Native architecture, serverless computing, message queues, PaaS, SaaS, IaaS |
| Week 3 | Types of Data: Data formats, sources & their semantics, processing & storage options on Cloud. Use of serverless to get started (e.g. Google Cloud F |
| Week 4 | Intro to Big Data Engineering: Hadoop and PySpark |
| Week 5 | ELT: ETL, processing patterns for large data, ETL vs ELT, role of a scheduler |
| Week 6 | SQL & NoSQL: For most analysis tasks, SQL is sufficient. Tools |
| like Spark SQL allow that familiarity to translate to big data | |
| solutions. Types of NoSQ | |
| Week 7 | Streaming: Overview, Fundamental Concepts, Walkthrough of Google Pub/Sub & Google DataFlow as example technologies |
| Week 8 | Streaming: Kafka as another example of message queue technology & Spark Streaming |
| Week 9 | Big Data ML: DataProc with ML - including Spark ML (Batch processing) |
| Week 10 | Deep Learning with big data on cloud. |
| Week 11 | Prep week for final project, summarizing key concepts, and also for Q&A and clarifications |
📝 About the Instructors
Rangarajan Vasudevan
Co-Founder & Chief Data Officer
,
Lentra.ai
Rangarajan Vasudevan is the Co-Founder & CDO of Lentra.ai, India’s fastest growing lending cloud. He did “big data” & “data science” before it was fashionable, building data-native applications across industries and geographies over 15+ years.
...
more
Ranga joined Lentra by way of an acquisition in June 2022 of his company TheDataTeam, creators of Cadenz.ai customer intelligence platform. Prior to founding TheDataTeam, Ranga served as Director, Big Data with Teradata Corporation’s international business unit. Ranga joined Teradata via the acquisition of Aster Data Systems, where he was a founding engineer and co-invented a company-defining, patented, pattern recognition algorithm. He is a recipient of both the Distinguished Engineer (R&D) and Consulting Excellence awards while at Teradata.
Ranga has degrees in Computer Science from the University of Michigan and IIT Madras.
Faculty mail : rangarajan[at]study[dot]iitm[ac]in
less