[wenceslao.palma@pucv.cl bigDataMining]$ ls
email: wenceslao.palma@pucv.cl, hector.allende@pucv.cl
Programa de la asignatura
Apuntes
Big data: intro
Big Data: MapReduce
Big Data: HDFS & MapReduce internals
Big Data: Hadoop code examples source code wordCount (python)
Big Data: Mapreduce and Parallel DBMS
Big Data: Pig source code
Big Data: Spark word count
Big Data: Data Streams Management Systems
Big Data: stream processing with Spark streaming streaming windows
Big Data: NoSQL
Big Data: Hive
Big Data Mining: introduction
Tareas
Tarea #1: Efficient Skyline Computation in MapReduce
Papers
Lecturas y links de interes
The promise of big data
Eight (No, Nine!) Problems With Big Data
MapReduce: Simplified Data Processing on Large Clusters
MapReduce: A Flexible Data Processing Tool