Registry Synced

BSDA5001 - Introduction to Big Data

474 words
2 min read
FieldValue
Course CodeBSDA5001
LevelDegree Level Course
Credits4
TypeElective
Pre-requisitesNone
VideosYouTube Playlist

📖 Description

This course will introduce students to practical aspects of analytics at a large scale, i.e. big data. The course will start with a basic introduction to big data and cloud concepts spanning hardware, systems and software, and then delve into the details of algorithm design and execution at large scale.

🗓️ Weekly Syllabus

WeekTopic
Week 1Introduction: ​Big data concepts & GCP Platform Setup
Week 2Cloud concepts​: ​Cloud-Native architecture, serverless computing, message queues, PaaS, SaaS, IaaS
Week 3Types of Data​:​ Data formats, sources & their semantics, processing & storage options on Cloud. Use of serverless to get started (e.g. Google Cloud F
Week 4Intro to Big Data Engineering​:​ Hadoop and PySpark
Week 5ELT​:​ ETL, processing patterns for large data, ETL vs ELT, role of a scheduler
Week 6SQL & NoSQL: For most analysis tasks, SQL is sufficient. Tools
like Spark SQL allow that familiarity to translate to big data
solutions. Types of NoSQ
Week 7Streaming​: Overview, Fundamental Concepts, Walkthrough of Google Pub/Sub & Google DataFlow as example technologies
Week 8Streaming​: Kafka as another example of message queue technology & Spark Streaming
Week 9Big Data ML​:​ DataProc with ML - including Spark ML (Batch processing)
Week 10Deep Learning​ with big data on cloud.
Week 11Prep week for final project, summarizing key concepts, and also for Q&A and clarifications

📝 About the Instructors

Rangarajan Vasudevan
Co-Founder & Chief Data Officer
,
Lentra.ai
Rangarajan Vasudevan is the Co-Founder & CDO of Lentra.ai, India’s fastest growing lending cloud. He did “big data” & “data science” before it was fashionable, building data-native applications across industries and geographies over 15+ years.
...
more
Ranga joined Lentra by way of an acquisition in June 2022 of his company TheDataTeam, creators of Cadenz.ai customer intelligence platform. Prior to founding TheDataTeam, Ranga served as Director, Big Data with Teradata Corporation’s international business unit. Ranga joined Teradata via the acquisition of Aster Data Systems, where he was a founding engineer and co-invented a company-defining, patented, pattern recognition algorithm. He is a recipient of both the Distinguished Engineer (R&D) and Consulting Excellence awards while at Teradata.
Ranga has degrees in Computer Science from the University of Michigan and IIT Madras.
Faculty mail : rangarajan[at]study[dot]iitm[ac]in
less

Document Outline
Table of Contents
System Normal // Awaiting Context

Intelligence Hub

Navigate the knowledge graph to generate context. The Hub adapts dynamically to surface backlinks, related notes, and metadata insights.