[wenceslao.palma@ucv.cl bigData]$ ls
Programa de la asignatura
Apuntes
Intro: MapReduce
Intro: MapReduce (cont.)
MapReduce and Parallel DBMS
Mapreduce: Hadoop
Hadoop: developping apps source code
Pig
Data Stream Management Systems
Query processing (repaso)
DSMS: query operators (join)
DSMS: sliding windows-CQL
Tareas
Tarea #1: An efficient Mapreduce Algorithm for Counting Triangles in a Very Large Graph
Triangle Listing in Massive Networks and Its Applications (algoritmo basico para el conteo de triangulos-Algorithm 1 In-Memory Triangle Listing).
Papers
Processing Interval Joins on MapReduce (Agustin Salas)
SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures (Sebastian Mansilla)
Scientific Computing Meets Big Data Technology: An Astronomy Use Case (Alvaro Gomez)
Spark SQL: Relational Data Processing in Spark (Adrian Jaramillo)
Lecturas y links de interes
MapReduce: Simplified Data Processing on Large Clusters
MapReduce: A Flexible Data Processing Tool
Spark
Spark Streaming