Collaborating Authors

DataStax reconnects with Apache Cassandra


Breaking up is hard to do. A couple years ago, DataStax vacated leadership of the Apache Cassandra project that it created as the community demanded a bigger voice. New players had to move into new roles, while the annual community event was allowed to lapse. Members of the community voiced their angst. As the Apache Cassandra community ventures down the homestretch of readying release 4.0 of the open source platform for release, there is light at the end of the tunnel as DataStax has started reconnecting with the community.

Officials Work to Save Native Apache Trout From Extinction

U.S. News

Arizona Game & Fish employees Bryan Giordano and Mike Lopez are tasked with the Apache trout's survival in Pinetop. They will walk up shallow streams in tandem, Giordano wearing a large electrofishing backpack plugged into a pole he sweeps in the water. Lopez walks close behind with a net, waiting for Giordano to stun an Apache trout they can catch, examine and safely release.

SF Spark and Friends


This is a developer-centric meetup focused on Apache Spark, Apache Flink, Apache Kafka, Apache Mesos, related Typesafe and Twitter OSS stacks, and broader distributed Data Science and Machine Learning. How it may be complementary to the original Spark Users, now Bay Area Spark Meetup: Spark in its end-to-end ecosystem -- Mesos, Akka, Kafka, Cassandra, etc., with focus on what works for the final goals of the whole pipeline. We will teach you how to use Scala for Spark to make you more effective, and consider devops options so you can get to production faster. We'll invite projects relevant to or inspired by Apache Spark, such as Apache Storm, Apache Flink, and others, and will be focused on putting together useful OSS as a system.

One simple chart: Who is interested in Apache Pulsar?


For more on Apache Kafka, Apache Pulsar, Apache Spark, and other data technologies, attend the "Data Engineering & Architecture" sessions at the Strata Data Conference in New York City, September 23-26, 2019. With companies producing data from an increasing number of systems and devices, messaging and event streaming solutions--particularly Apache Kafka--have gained widespread adoption. Over the past year, we've been tracking the progress of Apache Pulsar (Pulsar), a less well-known but highly capable open source solution originated by Yahoo. Pulsar is designed to intelligently process, analyze, and deliver data from an expanding array of services and applications, and thus it fits nicely into modern data platforms. Pulsar is also designed to ease the operational burdens normally associated with complex, distributed systems.

What is Apache Spark?


Mike Olson, Chief Strategy Officer and Co-Founder at Cloudera, explains Apache Spark's origins, its rise in popularity in the open source community, and how Spark is primed to replace MapReduce as the general processing engine in Hadoop.