csu logo green Computer Science Department

Big Data: Schedule CS535
Spring 2021
Home Syllabus Schedule Assignments Grading Policy Course Policy Code of Conduct Canvas
Note that this schedule will be altered during the semester. Please make sure to check it every week.
Big Data Frameworks: Week 1 - Week 5

Week 1 (1/18-1/24)

Topics
Introduction to Big Data
Course Introduction


Readings, Lecture Notes/ Video
Available in Canvas

CSU Academic Calendar 2020-21 [Link]

Week 2 (1/25-1/31)

Topics
Data process paradigms for Big Data
Scalable Distributed File Systems: Google File System I and II


Readings, Lecture Notes/ Video
Available in Canvas

Notes
CSU Academic Calendar 2020-21 [Link]


Week 3 (2/1-2/7)

Topics
Distributed Computing Models for Scalable Batch Computing: In-Memory Cluster Computing- Apache Spark (Part A)

Readings, Lecture Notes/ Video
Available in Canvas



Week 4 (2/8-2/14)

Topics
Distributed Computing Models for Scalable Batch Computing: In-Memory Cluster Computing- Apache Spark (Part B)

Readings, Lecture Notes/ Video
Available in Canvas



Week 5 (2/15-2/21)

Topics
Real-time Streaming Computing Models: Apache Storm and Twitter Heron

Readings, Lecture Notes/ Video
Available in Canvas



Guided Exploration for Big Data Analytics Research (GEAR) [Visit the GEAR Session page]
The CS535 Guided Exploration for Big Data Analytics Research (GEAR) Sessions are designed to provide a guided learning environment for advanced topics in Big Data analytics research.  GEAR involves active participation from students. The class will involve lectures (up to 75%) discussing fundamental concepts of the targeted topic and about 25% of the class will be based on student-led research discussions. Students will provide a critical review of cutting-edge research papers. These discussions provide students an opportunity to extend their knowledge and concepts covered in the lectures to real-world problems, and further explore future research directions. More info
[GEAR Session I] Peta-scale Storage Systems

Duration
Week 6 and 7: 2/22 - 3/7

Topics
Peta-scale Storage Systems
Scalable NoSQL Systems: DHT based key-value storage

Readings, Lecture Notes/ Video
Available in Canvas


[GEAR Session II] Machine Learning for Big Data

Duration
Week 8, 9, and 10: 3/8 - 3/28

Topic
Deep Learning for Big Data
- Deep Learning with PyTorch
and TensorFlow

Readings, Lecture Notes/ Video
Available in Canvas

3/24: TP Proposal presentation

[GEAR Session III] Big Graph Analytics with Social Media Analysis

Duration
Week 11 and 12: 3/29 - 4/11


Topic
Big Graph Analytics with Social Media Analysis

Readings, Lecture Notes/ Video
Available in Canvas


Week 13: Spring Recess - No Class
[GEAR Session IV] Algorithmic Techniques for Big Data

Duration
Week 14 and 15: 4/19-5/2

Topics
Algorithmic Techniques for Big Data

Readings, Lecture Notes/ Video
Available in Canvas



[Term Prject Presentation Week] Week 16

Schedule
TBA