Apache Hadoop

Quick Overview

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware ... Find out more



Dispy

dispy is a framework, developed in Python, for parallel execution of computations by distributing them across multiple processors on a single machine (SMP), among many machines in a cluster, grid ...


Categories: Software Tags: Python Distributed Computing Clustering

HPCC

HPCC (High-Performance Computing Cluster), also known as DAS (Data Analytics Supercomputer), is an open source Data Intensive Computing system platform developed by LexisNexis Risk Solutions. The HPCC platform incorporates a ...


Categories: Software Tags: Cloud Computing Business Intelligence data-analysis big-data machine-learning predictive-analytics

Elastic Phoenix

Phoenix is a shared-memory implementation of the MapReduce parallel programming framework. Phoenix was originally developed at Stanford University. One limitation of Phoenix is that once a Phoenix MapReduce application has ...


Categories: Software Tags: Cloud Computing Map Reduce

Plasma MapReduce

PlasmaFS is a distributed filesystem for large files, implemented in user space. Plasma Map/Reduce runs the famous algorithm scheme for mapping and rearranging large files. Plasma KV is a key/value ...


Categories: Software Tags: MapReduce Distributed Filesystem

New Comment
There are no comments to this product.

New Question
There are no questions to this product.

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-avaiability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-availabile service on top of a cluster of computers, each of which may be prone to failures.

Platforms Cross-Platform
License Apache License 2.0
Learning Resources http://bigdatauniversity.com/
Other

Find us on facebook

Find us on twitter

NewsLetter