Goto

Collaborating Authors

 realtime applicationbatch application simple analysis


Scalable, Distributed, Deep Machine Learning for Big Data

#artificialintelligence

Apache Thrift The Thrift stack is a common class hierarchy implemented in each language that abstracts out the tricky details of protocol encoding and network communication 26. Chukwa A data collection system for monitoring large distributed systems; Provides flexible/powerful toolkit to display, monitor, and analyze results; Architecture: Agents - run on each machine and emit data; Collectors - receive data from the agent and write it to stable storage; MapReduce jobs - parsing and archiving the data; Hadoop Infrastructure Care Center - a web-portal style interface.