Apache Hadoop
Quick Overview
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware ... Find out more
- Added By
mohamed
- Categories
Software Development - Tags
MapReduce Cloud Big Data Large Scale Distributed File System - Links
- Item Website
Item website
Dispy
dispy is a framework, developed in Python, for parallel execution of computations by distributing them across multiple processors on a single machine (SMP), among many machines in a cluster, grid ...
Categories: Software Tags: Python Distributed Computing Clustering
HPCC
HPCC (High-Performance Computing Cluster), also known as DAS (Data Analytics Supercomputer), is an open source Data Intensive Computing system platform developed by LexisNexis Risk Solutions. The HPCC platform incorporates a ...
Categories: Software Tags: Cloud Computing Business Intelligence data-analysis big-data machine-learning predictive-analytics
Elastic Phoenix
Phoenix is a shared-memory implementation of the MapReduce parallel programming framework. Phoenix was originally developed at Stanford University. One limitation of Phoenix is that once a Phoenix MapReduce application has ...
Categories: Software Tags: Cloud Computing Map Reduce
Plasma MapReduce
PlasmaFS is a distributed filesystem for large files, implemented in user space. Plasma Map/Reduce runs the famous algorithm scheme for mapping and rearranging large files. Plasma KV is a key/value ...
Categories: Software Tags: MapReduce Distributed Filesystem
- Page 1 of 2
- « First
- «
- 1
- 2
- »
- Last »
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures.
| Platforms | Cross-Platform |
|---|---|
| License | Apache License 2.0 |
| Learning Resources | http://bigdatauniversity.com/ |
| Other |