FLaNK Stack Weekly for 5 September 2023

 

05-September-2023

FLiPN-FLaNK Stack Weekly

Tim Spann @PaaSDev

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

Get your new Apache NiFi for Dummies!

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

**This is Issue #101 **

https://github.com/tspannhw/FLiPStackWeekly

https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

https://www.cloudera.com/solutions/dim-developer.html

My latest talk will be streaming on September 13th on NiFi, Kafka, Flink and LLM.

https://www.cloudera.com/about/events/cloudera-now-cdp.html

Happy Labor Day US!

Flink got added to OSS Chat! https://osschat.io/chat?project=Flink

Releases

NiFi 1.23.2

Recent Talk

https://www.slideshare.net/bunkertor/aidevday-datainmotion-to-supercharge-ai

Articles

https://medium.com/@tspann/streaming-llm-with-apache-nifi-huggingface-ad2f0d367468

https://www.amazon.science/news-and-features/amazon-bedrock-offers-access-to-multiple-generative-ai-models?

https://blog.cloudera.com/new-accreditations-for-cloudera-partners/

https://zilliz.com/blog/building-llm-apps-100x-faster-responses-drastic-cost-reduction-using-gptcache?utm_source=linkedin&utm_medium=social&utm_term=zilliz

https://www.deepmind.com/blog/identifying-ai-generated-images-with-synthid

https://medium.com/cloudera-inc/cloudera-flow-management-2-1-6-df34c3061aaf

https://blog.devgenius.io/building-intuition-retrieval-augmented-generation-vs-fine-tuning-3a275363dec1

https://github.com/estebanpdl/osintgpt

https://towardsdatascience.com/when-change-data-capture-wins-271875e3df1a

https://medium.com/cloudera-inc/collecting-netflow-records-with-cloudera-dataflow-f47d9f57c98

https://docs.cloudera.com/cfm/2.1.6/using-flow-library-registry-client/topics/cfm-adding-flow-library-registry-client.html

https://docs.cloudera.com/cfm/2.1.6/nifi-components/index.html

https://antonz.org/mastering-curl/

Videos

AICamp - AI Dev Day - NYC 2023 - August 23, 2023 - NiFi + LLM https://youtu.be/l0wPG9zXod0?si=Fhy0K0cNwLK29Py8&t=6820

https://youtu.be/B2ORocuzSzM?si=PrHtCM2UZZhT1UZk

https://www.youtube.com/watch?app=desktop&v=fyB8aUgT14w&feature=youtu.be#dialog

https://youtu.be/fyB8aUgT14w?si=AiONqsP0zs0vERzs

Events

September 13, 2023: Cloudera Now https://www.cloudera.com/about/events/cloudera-now-cdp.html?internal_keyplay=ALL&internal_campaign=FY24-Q3_AMER_Cloudera_Now_WEB_H10&cid=701Hr0000025VuVIAU&internal_link=h10

September 14, 2023: SkillUpSeries: Enable a Streaming Change Data Capture (CDC) Solution. Virtual. https://attend.cloudera.com/skillupseriesseptember14

Sept 21, 2023: Sao Paulo, Brazil. Evolve https://br.cloudera.com/about/events/evolve/sao-paulo.html

October 7-10, 2023: Halifax, CA. Community over Code. https://communityovercode.org/

October 8, 2023: Streaming Track, Room 102 https://communityovercode.org/schedule/#Oct8 https://communityovercode.org/schedule-list/#SG007 https://communityovercode.org/schedule-list/#SG011

October 10, 2023: Internet of Things Track, Room 109 https://communityovercode.org/schedule/#Oct10 https://communityovercode.org/schedule-list/#IOT001

October 18, 2023: 2-Hours to Data Innovation: Data Flow https://www.cloudera.com/about/events/hands-on-lab-series-2-hours-to-data-innovation.html

November 1, 2023: Open Source Finance Forum. Virtual. https://events.linuxfoundation.org/open-source-finance-forum-new-york/ November 2, 2023: Evolve. NYC https://www.cloudera.com/about/events/evolve/new-york.html#register

November 7, 2023: XtremeJ 2023. Virtual. https://xtremej.dev/2023/schedule/

November 8, 2023: Flink Forward, Seattle. https://www.flink-forward.org/seattle-2023

November 21, 2023: JCon World. Virtual. https://sched.co/1RRWm

November 22, 2023: Big Data Conference. Hybrid
https://bigdataconference.eu/ https://events.pinetool.ai/3079/#sessions/101077

Cloudera Events https://www.cloudera.com/about/events.html

More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

Code

Tools

Ex-Clouderan, Awesome Guy and Data Scientist Superstar has released some amazing GenAI tools: You must download these!!!

© 2020-2023 Tim Spann

FLaNK Stack Weekly for 28 August 2023

 

28-August-2023

FLiPN-FLaNK Stack Weekly

Tim Spann @PaaSDev

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

Get your new Apache NiFi for Dummies!

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

The 25th was my daughter's birthday, so it was a good weekend. Lots of great things are coming. AI Dev Day in NYC was amazing, over 200 people, lots of speakers and they were so good that I actually learned some LLM, Vector Database and some AI processing. I also got to work with a video crew for some upcoming short items. If you are interested in certain articles, videos, slides or demos please reach out.

cat

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

**This is Issue #100 **

https://github.com/tspannhw/FLiPStackWeekly

https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

https://www.cloudera.com/solutions/dim-developer.html

My latest talk will be streaming on September 13th on NiFi, Kafka, Flink and LLM.

https://www.cloudera.com/about/events/cloudera-now-cdp.html?utm_medium=email&utm_source=newsletter&keyplay=ALL&utm_campaign=FY24-Q3_AMER_Cloudera_Now_XP_WkEmail&cid=701Hr0000025Vu6IAE

Releases

NiFi 1.23.2

Recent Talk

https://www.slideshare.net/bunkertor/aidevday-datainmotion-to-supercharge-ai

https://www.linkedin.com/feed/update/urn:li:activity:7100451771470249984/

Articles

https://medium.com/@tspann/streaming-llm-with-apache-nifi-huggingface-ad2f0d367468

https://kevinbtalbert.github.io/nifi/nifi-splunk/

https://thenewstack.io/comparing-different-vector-embeddings/

https://www.schemastore.org/json/

https://nightlies.apache.org/flink/flink-docs-release-1.17/docs/dev/python/table/udfs/vectorized_python_udfs/

https://medium.com/cloudera-inc/consume-slacks-events-api-with-cloudera-flow-management-49fed7c2a531

https://newsletter.victordibia.com/p/practical-steps-to-reduce-hallucination?utm_campaign=post&utm_medium=web

http://www.tidepool.so/2023/08/17/why-you-probably-dont-need-to-fine-tune-an-llm/

https://dzone.com/articles/integration-testing-of-non-blocking-retries-with-s

https://thenewstack.io/what-do-java-developers-think-of-the-rise-of-genai/

https://medium.com/cloudera-inc/building-an-effective-nifi-flow-queryrecord-cca5ba51afd5

https://medium.com/@deephavendatalabs/a-high-performance-csv-reader-with-type-inference-4bf2e4baf2d1

https://www.alibabacloud.com/blog/all-you-need-to-know-about-pyflink_600306

https://flink.apache.org/2023/08/04/announcing-three-new-apache-flink-connectors-the-new-connector-versioning-strategy-and-externalization/

Events

https://attend.cloudera.com/ameropendatalakehousewithcdpon?lid=7vxyhds3tlv7

Sept 21, 2023: Sao Paulo, Brazil. Evolve https://br.cloudera.com/about/events/evolve/sao-paulo.html

October 7-10, 2023: Halifax, CA. Community over Code. https://communityovercode.org/

October 8, 2023: Streaming Track, Room 102 https://communityovercode.org/schedule/#Oct8 https://communityovercode.org/schedule-list/#SG007 https://communityovercode.org/schedule-list/#SG011

October 10, 2023: Internet of Things Track, Room 109 https://communityovercode.org/schedule/#Oct10 https://communityovercode.org/schedule-list/#IOT001

October 18, 2023: 2-Hours to Data Innovation: Data Flow https://www.cloudera.com/about/events/hands-on-lab-series-2-hours-to-data-innovation.html

November 1, 2023: Open Source Finance Forum. Virtual. https://events.linuxfoundation.org/open-source-finance-forum-new-york/ November 2, 2023: Evolve. NYC https://www.cloudera.com/about/events/evolve/new-york.html#register

November 7, 2023: XtremeJ 2023. Virtual. https://xtremej.dev/2023/schedule/

November 8, 2023: Flink Forward, Seattle. https://www.flink-forward.org/seattle-2023

November 22, 2023: Big Data Conference. Hybrid
https://bigdataconference.eu/ https://events.pinetool.ai/3079/#sessions/101077

Cloudera Events https://www.cloudera.com/about/events.html

More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

Code

Tools

Tool to validate Avro Schemas Online! http://avro.tarantool.org/#

© 2020-2023 Tim Spann