APACHE METRON INCUBATING AS A CASE STUDY OF A MODERN STREAMING ARCHITECTURE ON HADOOP
DataWorks Summit DataWorks Summit
3.42K subscribers
4,804 views
0

 Published On Jul 15, 2017

There have been many voices discussing how to architect streaming
applications on Hadoop. Before now, there have been very few worked
examples existing within the open source. Apache Metron (Incubating) is a
streaming advanced analytics cybersecurity application which utilizes
the components within the Hadoop stack as its platform.

Link to Slides: https://www.slideshare.net/Hadoop_Sum...

We will attempt to go beyond theoretical discussions of Kappa vs Lambda
architectures and describe the nuts and bolts of a streaming
architecture that enables advanced analytics in Hadoop. We will discuss
the componentry that we had to build and what we could utilize. We will
discuss why we made the architectural decisions that we made and how
they fit together to knit together a coherent application on top of many
different Hadoop ecosystem projects.

We will also discuss the domain specific language that we created out of
necessity to enable a pluggable layer to enable user defined enrichments.
We will discuss how this helped make Metron less rigid and easier to
use. We will also candidly discuss mistakes that we made early on.

Speaker:
CASEY STELLA
Principal Software Engineer/Data Scientist
Hortonworks

Link to Slides: https://www.slideshare.net/Hadoop_Sum...

Link to event session page: https://dataworkssummit.com/san-jose-...

show more

Share/Embed