Flink becomes a Top-Level Apache Project
The Apache Software Foundation announced Flink as a Top-Level Apache project. Read more in the ASF’s press release.
View ArticleComputing Recommendations at Extreme Scale with Apache Flink
Note: This article is a summary of the more detailed article How to factorize a 700 GB matrix with Apache Flink. Recommender Systems and Matrix Factorization Recommender Systems are a very...
View ArticleHow to factorize a 700 GB matrix with Apache Flink - A story full of valuable...
This article is a follow-up post to the earlier published article about Computing recommendations at extreme scale with Apache Flink. We discuss how we implemented the alternating least squares (ALS)...
View ArticleAnnouncing Google Cloud Dataflow on Flink and easy Flink deployment on Google...
Today, we are pleased to announce a deeper engagement between Google, data Artisans, and the broader Apache Flink community to bring easy Flink deployment to Google Cloud Platform, and enable Google...
View ArticleReal-time stream processing: The next step for Apache Flink
This post also appears as a guest post at the Confluent blog. Stream processing is becoming very popular with open source projects like Apache Kafka, Apache Samza, Apache Storm, Apache Spark’s...
View ArticleStream processing at Bouygues Telecom with Apache Flink
This is a guest post written by Mohamed Amine Abdessemed from Bouygues Telecom About Bouygues Telecom Bouygues Telecom is a full-service communication operator (mobile, fixed telephony, TV, Internet,...
View ArticleHigh-throughput, low-latency, and exactly-once stream processing with Apache...
The popularity of stream data platforms is skyrocketing. Several companies are transitioning parts of their data infrastructure to a streaming paradigm as a solution to increasing demands for real-time...
View ArticleHow Flink handles backpressure
People often ask us how Flink deals with backpressure effects. The answer is simple: Flink does not use any sophisticated mechanism, because it does not need one. It gracefully responds to backpressure...
View ArticleKafka + Flink: A practical, how-to guide
A very common use case for Apache Flink is stream data movement and analytics. More often than not, the data streams are ingested from Apache Kafka, a system that provides durability and pub/sub...
View ArticleA look at Flink Forward 2015
At data Artisans we are very proud to be part of the organization team behind Flink Forward 2015, the first conference we are organizing around the Apache Flink project. With the agenda of the...
View ArticleBatch is a special case of streaming
Interested in stream processing? Sign up for Flink Forward 2015, the first conference on Apache Flink. In recent blog posts, we introduced what we deem as requirements for systems to classify as stream...
View ArticleAnnouncing Cascading on Flink
See also the related announcement at the Cascading blog. Today we are thrilled to announce the first availability of Cascading on Flink, a result of a community-driven effort that brings together...
View ArticleHow we selected Apache Flink as our Stream Processing Framework at the Otto...
This is a guest post written by Christian Kreutzfeldt (@mnxfst) and Alexander Kolb (@lofifnc) from the Otto Group Business Intelligence Department. The Hamburg-based Otto Group is the world’s...
View ArticleReflections on Flink Forward 2015
Flink Forward 2015 was the inaugural conference around the Apache Flink community and took place at the beautiful Kulturbrauerei in Berlin, a former brewery turned into a fantastic event space....
View ArticleFlink 0.10: A significant step forward in open source stream processing
We are delighted to see that the Flink community has announced the availability of Apache Flink 0.10. The 0.10 release is one of the largest Flink releases ever, with about 80 individuals resolving...
View ArticleHow Apache Flink enables new streaming applications
Stream data processing is booming in popularity, as it promises better insights from fresher data, as well as a radically simplified pipeline from data ingestion to analytics. Data production in the...
View ArticleDataflow proposed as Apache Incubator project
Today, we are delighted to see Dataflow publicly proposed as an Apache Incubator project with an initial project team from Google, data Artisans, Cloudera, and others. data Artisans is the second...
View ArticleExtending the Yahoo! Streaming Benchmark
Until very recently I’ve been working at Twitter and focusing primarily on stream processing systems. While researching the current state-of-the-art in stateful streaming systems I came across Apache...
View ArticleHow Apache Flink enables new streaming applications
This is the second blog post in our series on how Flink enables new streaming applications. The first post introduced event time and out of order processing. This post is about versioning application...
View ArticleFlink 1.0: General availability and pushing the envelope in open source...
We are delighted to see that the Flink community has announced the availability of Apache Flink 1.0. This release is one of the largest Flink releases ever, with about 64 individuals resolving more...
View Article