Are people in your data analytics organization contemplating the impending data avalanche from the internet of things and thus asking this question: "Spark or Hadoop?" The internet of things (IOT) will generate massive quantities of data. In most cases, these will be streaming data from ubiquitous sensors and devices. Often, we will need to make real-time (or near real-time) decisions based off of this tsunami of data inputs. How will we efficiently manage all of this, make effective use of it, and become lord over it before it becomes lord over us?
Oct-14-2017, 16:10:19 GMT