Showing posts with label genai. Show all posts
Showing posts with label genai. Show all posts

FLaNK AI - 01 April 2024

 

01-April-2024



image

FLaNK / KNIFe AI Weekly

https://knifeai.blogspot.com/

Tim Spann @PaaSDev

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

image

COOL CHARITY by KIDS!

https://www.unveilx.org/

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

image

**This is Issue #131 **

https://github.com/tspannhw/FLiPStackWeekly

https://www.cloudera.com/solutions/dim-developer.html

New Releases

Apache Hive 4.0.0 https://hub.docker.com/r/apache/hive

Articles

Meetup Report https://medium.com/@tspann/march-2024-meetup-report-61e82b00cf57

Real-Time Irish Transit Analytics https://medium.com/@tspann/real-time-irish-transit-analytics-ea76164c9595

Adding Generative AI Results to SQL Streams https://medium.com/@tspann/adding-generative-ai-results-to-sql-streams-513e1fd2a6af

Image Processing with Custom Python and Apache NiFi 2.0 https://medium.com/@tspann/image-processing-with-custom-python-and-nifi-2-0-06eadc62c03c

Cloudera + GenAI + NVIDIA NIM Microservices https://menews247.com/cloudera-to-enhance-genai-with-nvidia-nim-microservices/

https://blog.cloudera.com/data-architecture-and-strategy-in-the-ai-era/

https://blog.cloudera.com/clouderas-rhel-volution-powering-the-cloud-with-red-hat/

https://developer.nvidia.com/blog/translate-your-enterprise-data-into-actionable-insights-with-nvidia-nemo-retriever/

https://drive.google.com/file/d/11lCJAB272ruBa7AAVwYxaN2E2xooWizG/view

https://jack-vanlightly.com/blog/2024/3/26/the-sisyphean-struggle-and-the-new-era-of-data-infrastructure

https://pypi.org/project/streaming-jupyter-integrations/

https://thenewstack.io/how-nvidia-gpu-acceleration-supercharged-milvus-vector-database/

NiFi 2.0 Python https://medium.com/@sudeep.singh99/a-beginners-guide-to-nifi-2-0-custom-python-processor-ac6d8c7bda7b

Make sure you are on the write MacOS version for new Java https://blogs.oracle.com/java/post/java-on-macos-14-4

https://www.datanami.com/2024/03/22/zilliz-unveils-game-changing-features-for-vector-search

https://towardsdatascience.com/automated-detection-of-data-quality-issues-54a3cb283a91

https://mlops.community/7-methods-to-secure-llm-apps-from-prompt-injections-and-jailbreaks/?

https://www.startdataengineering.com/post/change-data-capture-using-debezium-kafka-and-pg/

https://medium.com/@hubert.dulay/stream-processing-vs-real-time-olap-vs-streaming-database-339c75ca6772

https://www.cloudera.com/about/news-and-blogs/press-releases/2024-03-28-global-survey-reveals-90-of-it-leaders-believe-that-unifying-the-data-lifecycle-on-a-single-platform-is-critical-for-analytics-and-ai.html

https://nvidianews.nvidia.com/news/nvidia-blackwell-platform-arrives-to-power-a-new-era-of-computing

https://netflixtechblog.com/bending-pause-times-to-your-will-with-generational-zgc-256629c9386b

https://www.uber.com/en-GB/blog/balancing-hdfs-datanodes-in-the-uber-datalake/

https://techcrunch.com/2024/03/31/why-aws-google-and-oracle-are-backing-the-valkey-redis-fork/

Videos

Meetup Talk NYC https://youtu.be/u8XNNEPEnKQ?si=VWe6n8OKOF7qk6Fl

Irish Rail Preview https://youtu.be/EIpH7RPO2Yo

TCF Pro 2024 https://www.youtube.com/watch?v=tLbdrOxg5Rs

Streaming Traffic Cameras https://www.youtube.com/watch?v=85ECRGJBEQU&ab_channel=DatainMotion-HowToBeaStreamingEngineer

NiFi 101 https://www.youtube.com/watch?v=8cZJ9CyLYyI&t=3114s

March 11, 2024 Princeton 23 Orchard Event

https://www.slideshare.net/slideshows/2024-build-generative-ai-for-nonprofits/266748822

march 15, 2024 Trenton TCF

https://www.slideshare.net/slideshows/tcfpro24-building-realtime-generative-ai-pipelines/266807785

Events

April 2, 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/

April 8-11, 2024: NLIT Summit. Seattle. https://www.fbcinc.com/e/nlit/default.aspx image

April 11, 2024: Conf42 LLM. Virtual. https://www.conf42.com/llms2024

April 12, 2024: AI Max Conference. 23 Orchard Princeton https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-hosts-ai-max-summit/

April 2024: AI Meetup NJ https://www.meetup.com/nj-gai/

EMEA | APAC: April 24, 2024 9:30 AM CEST | 1:00 PM IST AMER EVENT: Apr 25, 2024 9:00 AM PDT | 12:00 PM EDT Register Now: http://spr.ly/6047Z3AjN

May 8-9, 2024: Data Summit 2024. Boston, MA. https://www.dbta.com/DataSummit/2024/default.aspx https://www.dbta.com/DataSummit/2024/Timothy-Spann.aspx

May 21, 2024: Gen AI and Beyond with NiFi 2.0. Virtual.

June 12, 2024: Budapest Data + ML Forum. Virtual. image https://budapestdata.hu/2024/en/

Cloudera Events https://www.cloudera.com/about/events.html

https://www.cloudera.com/events/cloudera-now-cdp.html?internal_keyplay=ALL&internal_campaign=FY25-Q1-AMER-WS-Cloudera-Now-Events-Page-P06&cid=701Hr000000tW6qIAE&internal_link=p06

More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe

Code

Models

Tools

New

Vector Db built on clickhouse https://github.com/myscale/myscaledb

Cool Tool - LLM Synthetic Data Generators

https://github.com/geraldyong/OpenAI_Synthetic/tree/main

https://github.com/quentinlintz/synthetic-data-generator

https://medium.com/@n-demia/how-to-prepare-test-data-via-openai-api-in-postman-7e378dde1f53

https://github.com/datadreamer-dev/DataDreamer

https://huggingface.co/collections/rbiswasfc/synthetic-data-generation-65ee68e821ddaff47073ed02

Flink Connectors (scroll down)

https://flink.apache.org/downloads/

Avro

Can't handle numbers bigger than 19 decimals

Throwback Article

https://community.cloudera.com/t5/Community-Articles/Ingesting-RDBMS-Data-As-New-Tables-Arrive-Automagically-into/ta-p/246214

https://docs.cloudera.com/csp-ce/latest/ce-overview/topics/csp-ce-overview.html

Discount

Discount access to DataSummit 2024 https://secure.infotoday.com/RegForms/DataSummit/?Priority=24SPKR

© 2020-2024 Tim Spann

FLaNK AI Weekly 18 March 2024

 

18-March-2024

FLaNK Stack Weekly

Tim Spann @PaaSDev

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

image

Congrats to my wife for being the youngest Leader of our local Elks!

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

image

**This is Issue #129 **

https://github.com/tspannhw/FLiPStackWeekly

https://www.cloudera.com/solutions/dim-developer.html

New Releases

https://cldr-steven-matison.github.io//blog/CEM-2.1.2-Release/

Articles

Image Processing with Custom Python and Apache NiFi 2.0 https://medium.com/@tspann/image-processing-with-custom-python-and-nifi-2-0-06eadc62c03c

Mixtral Deep Dive https://dzone.com/articles/mixtral-generative-sparse-mixture-of-experts-in-da

AI Augmented DevRel part 1 https://medium.com/@tspann/ai-augmented-devrel-part-1-4058af905a89

Next Level Flink with Nussknacker https://medium.com/@tspann/next-level-flink-with-nussknacker-fe7294e2ef21

Mixtral Generative Sparse Mixture of Experts in DataFlows https://medium.com/@tspann/mixtral-generative-sparse-mixture-of-experts-in-dataflows-59744f7d28a9

https://news.mit.edu/2024/researchers-enhance-peripheral-vision-ai-models-0308

https://medium.com/@1709deepesh/connecting-apache-nifi-to-microsoft-graph-reading-emails-with-invokehttp-processors-6d84db9fa157

https://community.cloudera.com/t5/Community-Articles/How-to-call-a-CML-Deployed-Model-From-Apache-NiFi-in-10/ta-p/374853

https://www.infoq.com/news/2024/03/java-22-so-far/

https://www.infoq.com/news/2024/03/lapce-rust-editor

https://www.infoq.com/news/2024/03/mistral-ai-aws/

https://www.infoq.com/news/2024/03/anthropic-claude-ai/

https://www.quantamagazine.org/new-breakthrough-brings-matrix-multiplication-closer-to-ideal-20240307/

https://venturebeat.com/ai/hugging-face-is-launching-an-open-source-robotics-project-led-by-former-tesla-scientist/?

https://dbos-project.github.io/

https://www.datanami.com/2024/03/07/cloudera-unveils-next-phase-of-open-data-lakehouse-to-unlock-enterprise-ai/?

https://www.decodable.co/blog/taxonomy-of-data-change-events

https://www.europarl.europa.eu/news/en/press-room/20240308IPR19015/artificial-intelligence-act-meps-adopt-landmark-law

https://medium.com/plain-simple-software/the-llm-app-stack-2024-eac28b9dc1e7

https://www.slideshare.net/JulienSIMON5/an-introduction-to-computer-vision-with-hugging-face

https://huggingface.co/learn/nlp-course/chapter1/2?fw=pt

https://huggingface.co/timm

https://github.com/huggingface/pytorch-image-models

https://www.slideshare.net/JulienSIMON5/an-introduction-to-computer-vision-with-hugging-face

https://www.infoq.com/news/2024/03/azure-openai-your-data-ga/

https://developers.redhat.com/articles/2024/03/13/kafka-tiered-storage-deep-dive?

https://community.cloudera.com/t5/What-s-New-Cloudera/Cloudera-DataFlow-adds-Change-Data-Capture-processors-flow/ba-p/381727

Videos

Streaming Traffic Cameras https://www.youtube.com/watch?v=85ECRGJBEQU&ab_channel=DatainMotion-HowToBeaStreamingEngineer

Python Processor https://www.youtube.com/watch?v=jF5FSY0xFiQ&t=9s&ab_channel=DatainMotion-HowToBeaStreamingEngineer

Preview of TCF Pro Talk https://youtu.be/ce9lhtbp48M?si=Svjb2-bIIPXLwXD1

Feb 22, 2024 NYC Meetup

https://www.slideshare.net/slideshows/2024-feb-ai-meetup-nyc-genaillmsmldata-codeless-generative-ai-pipelines/266444687

Feb 28, 2024 NYC Flink Meetup

https://www.slideshare.net/slideshows/2024-february-28-nyc-meetup-unlocking-financial-data-with-realtime-pipelines/266539528

Feb 29, 2024 Conf42 Python 2024

https://www.slideshare.net/slideshows/conf42python-using-apache-nifi-apache-kafka-risingwave-and-apache-iceberg-with-stock-data-and-llm/266521940

https://www.slideshare.net/slideshows/conf42pythonbuilding-apache-nifi-20-python-processors/266522007

https://www.youtube.com/watch?v=awxzG7laWx4&ab_channel=Conf42

https://www.youtube.com/watch?v=FD16_oZ65Ug&ab_channel=Conf42

March 11, 2024 Princeton 23 Orchard Event

https://www.slideshare.net/slideshows/2024-build-generative-ai-for-nonprofits/266748822

march 15, 2024 Trenton TCF

https://www.slideshare.net/slideshows/tcfpro24-building-realtime-generative-ai-pipelines/266807785

Events

March 27, 2024: Startup Grind. Jersey City https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-princeton-amp-nj-big-data-alliance-generative-ai-reverse-pitch/

March 28, 2024: Pinot + NiFi + Flink + Kafka Meetup NYC https://www.meetup.com/real-time-analytics-meetup-ny/events/299290822/

April 2, 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/

April 8-11, 2024: NLIT Summit. Seattle. https://www.fbcinc.com/e/nlit/default.aspx image

April 11, 2024: Conf42 LLM. Virtual. https://www.conf42.com/llms2024

April 12, 2024: AI Max Conference. 23 Orchard Princeton https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-hosts-ai-max-summit/

April 2024: AI Meetup NJ https://www.meetup.com/nj-gai/

May 8-9, 2024: Data Summit 2024. Boston, MA. https://www.dbta.com/DataSummit/2024/default.aspx

Cloudera Events https://www.cloudera.com/about/events.html

More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe

Code

Models

Datasets

Tools

Tips

https://www.datainmotion.dev/2020/05/one-minute-nifi-tip-calcite-sql-notes.html

Cool Tool

These are amazing diagrams and graphics.
https://drawify.com/templates/341/personal-user-manual

© 2020-2024 Tim Spann