Parallel computing
Distributed computing architecture
Cloud infrastructure
Data-intensive computing
MapReduce
Apache Hadoop
Data
Sampling
Parallelization contract