FLaNK Stack Weekly for 5 September 2023
05-September-2023
FLiPN-FLaNK Stack Weekly
Tim Spann @PaaSDev
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
Get your new Apache NiFi for Dummies!
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #101 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
https://www.cloudera.com/solutions/dim-developer.html
My latest talk will be streaming on September 13th on NiFi, Kafka, Flink and LLM.
https://www.cloudera.com/about/events/cloudera-now-cdp.html
Happy Labor Day US!
Flink got added to OSS Chat! https://osschat.io/chat?project=Flink
Releases
NiFi 1.23.2
Recent Talk
https://www.slideshare.net/bunkertor/aidevday-datainmotion-to-supercharge-ai
Articles
https://medium.com/@tspann/streaming-llm-with-apache-nifi-huggingface-ad2f0d367468
https://blog.cloudera.com/new-accreditations-for-cloudera-partners/
https://www.deepmind.com/blog/identifying-ai-generated-images-with-synthid
https://medium.com/cloudera-inc/cloudera-flow-management-2-1-6-df34c3061aaf
https://github.com/estebanpdl/osintgpt
https://towardsdatascience.com/when-change-data-capture-wins-271875e3df1a
https://medium.com/cloudera-inc/collecting-netflow-records-with-cloudera-dataflow-f47d9f57c98
https://docs.cloudera.com/cfm/2.1.6/nifi-components/index.html
https://antonz.org/mastering-curl/
Videos
AICamp - AI Dev Day - NYC 2023 - August 23, 2023 - NiFi + LLM https://youtu.be/l0wPG9zXod0?si=Fhy0K0cNwLK29Py8&t=6820
https://youtu.be/B2ORocuzSzM?si=PrHtCM2UZZhT1UZk
https://www.youtube.com/watch?app=desktop&v=fyB8aUgT14w&feature=youtu.be#dialog
https://youtu.be/fyB8aUgT14w?si=AiONqsP0zs0vERzs
Events
September 13, 2023: Cloudera Now https://www.cloudera.com/about/events/cloudera-now-cdp.html?internal_keyplay=ALL&internal_campaign=FY24-Q3_AMER_Cloudera_Now_WEB_H10&cid=701Hr0000025VuVIAU&internal_link=h10
September 14, 2023: SkillUpSeries: Enable a Streaming Change Data Capture (CDC) Solution. Virtual. https://attend.cloudera.com/skillupseriesseptember14
Sept 21, 2023: Sao Paulo, Brazil. Evolve https://br.cloudera.com/about/events/evolve/sao-paulo.html
October 7-10, 2023: Halifax, CA. Community over Code. https://communityovercode.org/
October 8, 2023: Streaming Track, Room 102 https://communityovercode.org/schedule/#Oct8 https://communityovercode.org/schedule-list/#SG007 https://communityovercode.org/schedule-list/#SG011
October 10, 2023: Internet of Things Track, Room 109 https://communityovercode.org/schedule/#Oct10 https://communityovercode.org/schedule-list/#IOT001
October 18, 2023: 2-Hours to Data Innovation: Data Flow https://www.cloudera.com/about/events/hands-on-lab-series-2-hours-to-data-innovation.html
November 1, 2023: Open Source Finance Forum. Virtual. https://events.linuxfoundation.org/open-source-finance-forum-new-york/ November 2, 2023: Evolve. NYC https://www.cloudera.com/about/events/evolve/new-york.html#register
November 7, 2023: XtremeJ 2023. Virtual. https://xtremej.dev/2023/schedule/
November 8, 2023: Flink Forward, Seattle. https://www.flink-forward.org/seattle-2023
November 21, 2023: JCon World. Virtual. https://sched.co/1RRWm
November 22, 2023: Big Data Conference. Hybrid
https://bigdataconference.eu/ https://events.pinetool.ai/3079/#sessions/101077
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
Code
Tools
- https://github.com/finos/morphir
- https://issues.apache.org/jira/browse/NIFI-8650
- https://solr.apache.org/guide/solr/latest/query-guide/dense-vector-search.html
- https://github.com/runreveal/kawa
- https://github.com/kousen/openaidemo
- https://picogen.io/
- https://github.com/Barre/privaxy
- https://github.com/facebookresearch/llama
- https://huggingface.co/meta-llama
- https://localai.io/
- https://nightlies.apache.org/flink/flink-ml-docs-release-2.3/docs/try-flink-ml/java/build-your-own-project/
- https://softwaremill.com/zookeeper-less-kafka/
- https://www.odbms.org/2023/08/building-llm-apps-with-100x-faster-responses-and-drastic-cost-reduction-using-gptcache/
- https://exceptionfactory.com/posts/2023/08/29/introducing-jagged-for-age-encryption-in-java/
- https://github.com/exceptionfactory/jagged
- https://github.com/magic-research/magic-edit
- https://github.com/google/paxml
- https://github.com/facebookresearch/co-tracker
- https://github.com/RizwanMunawar/yolov7-object-tracking
- https://github.com/zama-ai/concrete-ml
- https://github.com/Paulescu/hands-on-train-and-deploy-ml
- https://github.com/obra/Youtube2Webpage
- http://pepijndevos.nl/2023/07/15/chatlmza.html
- https://github.com/measuredco/puck
Ex-Clouderan, Awesome Guy and Data Scientist Superstar has released some amazing GenAI tools: You must download these!!!
© 2020-2023 Tim Spann
FLaNK Stack Weekly for 28 August 2023
28-August-2023
FLiPN-FLaNK Stack Weekly
Tim Spann @PaaSDev
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
Get your new Apache NiFi for Dummies!
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
The 25th was my daughter's birthday, so it was a good weekend. Lots of great things are coming. AI Dev Day in NYC was amazing, over 200 people, lots of speakers and they were so good that I actually learned some LLM, Vector Database and some AI processing. I also got to work with a video crew for some upcoming short items. If you are interested in certain articles, videos, slides or demos please reach out.
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #100 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
https://www.cloudera.com/solutions/dim-developer.html
My latest talk will be streaming on September 13th on NiFi, Kafka, Flink and LLM.
Releases
NiFi 1.23.2
Recent Talk
https://www.slideshare.net/bunkertor/aidevday-datainmotion-to-supercharge-ai
https://www.linkedin.com/feed/update/urn:li:activity:7100451771470249984/
Articles
https://medium.com/@tspann/streaming-llm-with-apache-nifi-huggingface-ad2f0d367468
https://kevinbtalbert.github.io/nifi/nifi-splunk/
https://thenewstack.io/comparing-different-vector-embeddings/
https://www.schemastore.org/json/
https://medium.com/cloudera-inc/consume-slacks-events-api-with-cloudera-flow-management-49fed7c2a531
http://www.tidepool.so/2023/08/17/why-you-probably-dont-need-to-fine-tune-an-llm/
https://dzone.com/articles/integration-testing-of-non-blocking-retries-with-s
https://thenewstack.io/what-do-java-developers-think-of-the-rise-of-genai/
https://medium.com/cloudera-inc/building-an-effective-nifi-flow-queryrecord-cca5ba51afd5
https://medium.com/@deephavendatalabs/a-high-performance-csv-reader-with-type-inference-4bf2e4baf2d1
https://www.alibabacloud.com/blog/all-you-need-to-know-about-pyflink_600306
Events
https://attend.cloudera.com/ameropendatalakehousewithcdpon?lid=7vxyhds3tlv7
Sept 21, 2023: Sao Paulo, Brazil. Evolve https://br.cloudera.com/about/events/evolve/sao-paulo.html
October 7-10, 2023: Halifax, CA. Community over Code. https://communityovercode.org/
October 8, 2023: Streaming Track, Room 102 https://communityovercode.org/schedule/#Oct8 https://communityovercode.org/schedule-list/#SG007 https://communityovercode.org/schedule-list/#SG011
October 10, 2023: Internet of Things Track, Room 109 https://communityovercode.org/schedule/#Oct10 https://communityovercode.org/schedule-list/#IOT001
October 18, 2023: 2-Hours to Data Innovation: Data Flow https://www.cloudera.com/about/events/hands-on-lab-series-2-hours-to-data-innovation.html
November 1, 2023: Open Source Finance Forum. Virtual. https://events.linuxfoundation.org/open-source-finance-forum-new-york/ November 2, 2023: Evolve. NYC https://www.cloudera.com/about/events/evolve/new-york.html#register
November 7, 2023: XtremeJ 2023. Virtual. https://xtremej.dev/2023/schedule/
November 8, 2023: Flink Forward, Seattle. https://www.flink-forward.org/seattle-2023
November 22, 2023: Big Data Conference. Hybrid
https://bigdataconference.eu/ https://events.pinetool.ai/3079/#sessions/101077
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
Code
- https://github.com/tspannhw/FLaNK-HuggingFace-BLOOM-LLM
- https://github.com/tspannhw/FLaNK-HuggingFace-DistilBert-SentimentAnalysis
- https://github.com/tspannhw/FLaNK-Edge-Models
- https://github.com/tspannhw/FLaNK-Halifax
Tools
- https://www.philschmid.de/cdk-llama2
- https://github.com/a16z-infra/ai-town
- https://github.com/getumbrel/llama-gpt
- https://huggingface.co/google/flan-ul2
- https://cwiki.apache.org/confluence/display/NIFI/Release+Notes#ReleaseNotes-Version1.23.2
- https://github.com/roboflow/supervision
- https://github.com/truera/trulens
- https://github.com/tin2tin/Generative_AI
- https://mough.xyz/312/psa-add-dir-auto-to-your-inputs-and-textareas
- https://github.com/Dicklesworthstone/fast_vector_similarity
- https://github.com/facebookresearch/seamless_communication
- https://github.com/facefusion/facefusion
- https://github.com/miguelaeh/pipeless
- https://github.com/ricklamers/shell-ai
- https://github.com/opencopilotdev/opencopilot
- https://github.com/chrieke/prettymapp
- https://jacobin.org/
- https://landrop.app/
- https://payload.app/
- https://webwormhole.io/
- https://github.com/mljar/automl-app
- https://github.com/abyildirim/inst-inpaint
- https://github.com/dai-shi/excalidraw-animate
- https://github.com/QwenLM/Qwen-VL
- https://github.com/neulab/prompt2model
- https://druid77.github.io/trs-gpt/
- https://github.com/n8n-io/n8n
- https://github.com/fish-shell/fish-shell
- https://github.com/oilshell/oil/wiki/Alternative-Shells
- https://www.py4j.org/
- https://bellard.org/ts_server/ts_zip.html
- https://github.com/mbcoder/weather-station/blob/main/Raspberry%20Pi%20Setup.md
Tool to validate Avro Schemas Online! http://avro.tarantool.org/#
© 2020-2023 Tim Spann