Big Data Now: 2014 Edition by O'Reilly Media Inc

Big Data Now: 2014 Edition by O'Reilly Media Inc

Author:O'Reilly Media, Inc.
Language: eng
Format: epub, mobi
Publisher: O'Reilly Media, Inc.
Published: 2015-01-15T05:00:00+00:00


Users of Spark explore Spark Streaming because similar code for batch (Spark) can, with minor modification, be used for realtime (Spark Streaming) computations. Along these lines, Summingbird—an open source library from Twitter—offers something similar for Hadoop MapReduce and Storm. With Summingbird, programs that look like Scala collection transformations can be executed in batch (Scalding) or realtime (Storm).

In some instances the underlying techniques from a set of tools makes its way into others. The DeepDive team at Stanford just recently revamped their information extraction and natural language understanding system. But already techniques used in DeepDive have found their way into many other systems including MADlib, Cloudera Impala, “a product from Oracle,” and Google Brain.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.