4 Credits
“Big data” deals with techniques for collecting, processing, analyzing, and acting on data at internet scale: unprecedented speed, scale, and complexity. This course introduces the latest techniques and infrastructures developed for big data, including parallel and distributed database systems, map-reduce infrastructures, scalable platforms for complex data types, stream processing systems, and cloud-based computing. You’ll learn to apply common statistical and machine learning techniques to large data sets. Course content will be a blend of theory, algorithms, and practical, hands-on work.