Skip to main content


Upcoming Events 2021

 Upcoming Events 2021 ApacheCon Asia -   06-August-2021 . PRO TALK: Continuous SQL with Kafka and Flink • 1 min read Scenic City Summit -  24-September-2021 ApacheCon 2021 - 21-September-2021 to 23-September-2021 Tuesday 17:10 UTC - Apache NIFi Deep Dive 300  Tuesday 18:00 UTC - Apache Deep Learning 302  Wednesday 15:00 UTC - Smart Transit: Real-Time Transit Information with FLaNK  Wednesday 17:10 UTC - Cracking the Nut, Solving Edge AI with Apache Tools and Frameworks  Thursday 14:10 UTC - Apache NiFi 101: Introduction and Best Practices  Big Data Conference EU - 28-September-2021 to 29-September-2021 API World - 26-October-2021 to 28-October-2021

NiFi on Cloudera Data Platform Upgrade - April 2021

CFM 2.1.1 on CDP 7.1.6 There is a new Cloudera release of Apache NiFi now with SAML support. Apache NiFi Apache NiFi Registry See:   For changes: Get your download on: To start researching for the future, take a look at some of the technical preview features around Easy Rules engine and handlers. Make sure you use the latest possible JDK 8 as there are some bugs out there.   Use a recent v

Populating Your Secure Cloud Data Estates

Populating Your Secure Cloud Data Estates Hydrating Your Clean Cloud Data Lake I am hard pressed to keep up with Data Store + Query terminology du jour.    Was it Data Lake House?   All these giant bodies of water mostly stored in buckets (S3)?    I agree there are lots of nuances and many different query engines on top of those various means for storing that data.   I don't think everytime we add a twist we need to add increasingly silly terms on top.   Is it to confuse users?  developers?  data engineers?  companies?   executives?   Perhaps if we change our data warehouse name again we can get them to buy the same thing again. Clearly it can't be one size fit all for all this different things?   I know a lot of companies of various types and sizes and most don't approach the size of the data that companies like Netflix and LinkedIn have.   I really like their innovation, but often those projects get released and then wither in obscurity. A few projects look really good: A

Cloudera SQL Stream Builder (SSB) - Update Your FLaNK Stack

Cloudera SQL Stream Builder (SSB) Released! CSA 1.3.0 is now available with Apache Flink 1.12 and SQL Stream Builder !    Check out this white paper for some details .    You can get full details on the Stream Processing and Analytics available from Cloudera here . This is awesome way to query Kafka topics with continuous SQL that is deployed to scalable Flink nodes in YARN or K8.   We can also easily define functions in JavaScript to enhance, enrich and augment our data streams.   No Java to write, no heavy deploys or build scripts, we can build, test and deploy these advanced streaming applications all from your secure browser interface. References: https://do