https://www.slideshare.net/slideshows/tcfpro24-building-realtime-generative-ai-pipelines/266807785
FLaNK AI for 11 March 2024
This week I am doing an AI meetup on Monday and a conference talk on Friday.
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #128 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.cloudera.com/solutions/dim-developer.html
https://docs.cloudera.com/cem/2.1.2/installation/topics/cem-install-cem-cm.html https://docs.cloudera.com/cem/2.1.2/release-notes/topics/cem-whats-new.html
NiFi Parameter Providers https://medium.com/@tspann/utilizing-apache-nifi-parameter-providers-36cf60313d5e
Mixtral Generative Sparse Mixture of Experts in DataFlows https://medium.com/@tspann/mixtral-generative-sparse-mixture-of-experts-in-dataflows-59744f7d28a9
Building an LLM Bot for Meetups and Conference Interactivity https://medium.com/@tspann/building-an-llm-bot-for-meetups-and-conference-interactivity-c211ea6e3b61
Kafka for Edge AI: Jetson Nano https://medium.com/@tspann/kafka-for-edge-ai-on-jetson-nano-enabling-efficient-data-streaming-c5bb01ca0705
Streaming Street Cams to YoLo v8 with Python and NiFi to MinIO (S3) https://medium.com/@tspann/streaming-street-cams-to-yolo-v8-with-python-and-nifi-to-minio-s3-3277e73723ce
Using OLLAMA with Mistral and Apache NiFi https://medium.com/@tspann/using-ollama-with-mistral-and-apache-nifi-720c17f5ff12
Using Google Gemma https://medium.com/@tspann/google-gemma-for-real-time-lightweight-open-llm-inference-88efe98e580f
https://medium.com/@tspann/open-source-vision-servers-pre-reqs-be2559e3ef52
https://readwrite.com/the-nsa-list-of-memory-safe-programming-languages-has-been-updated/
https://www.anthropic.com/news/claude-3-family
https://medium.com/@tspann/septa-transit-real-time-81082878b485
https://www.infosecurity-magazine.com/news/worm-created-generative-ai-systems/
https://cldr-steven-matison.github.io/blog/SSB-Iceberg-Time-Travel/
https://blog.devgenius.io/langchain-vs-llamaindex-vs-haystack-0d12d25b189e
https://github.com/milvus-io/milvus-haystack
https://towardsdatascience.com/deploying-llms-into-production-using-tensorrt-llm-ed36e620dac4
https://spectrum.ieee.org/prompt-engineering-is-dead
https://github.com/cldr-steven-matison/SSB-CDC-Demo
https://docs.vllm.ai/en/latest/getting_started/quickstart.html
https://thenewstack.io/why-large-language-models-wont-replace-human-coders/
https://thenewstack.io/the-rise-of-small-language-models/
https://verse.systems/blog/post/2024-03-09-using-llms-to-generate-fuzz-generators/
https://engineeringblog.yelp.com/2024/03/building-data-abstractions-with-streaming-at-yelp.html?u
https://blog.allegro.tech/2024/03/kafka-performance-analysis.html
https://medium.com/analytics-vidhya/postgresql-integration-with-jupyter-notebook-deb97579a38d
Streaming Traffic Cameras https://www.youtube.com/watch?v=85ECRGJBEQU&ab_channel=DatainMotion-HowToBeaStreamingEngineer
Joining Three Kafka Topics in Flink SQL https://youtu.be/NI2n7uQJiP0?si=0aAFrkhOdqzZKisw
Continouos SQL https://youtu.be/k1mANc88OJc?si=o--ysshxFPem4Cze
CDF https://youtu.be/Z1IZ7uK_76s?si=XjlmcTQhwQ8F8aD0
https://www.youtube.com/watch?v=awxzG7laWx4&ab_channel=Conf42
https://www.youtube.com/watch?v=FD16_oZ65Ug&ab_channel=Conf42
Encrypt a message until some date in the future. https://timelock.dev/
March 11, 2024: Princeton. Meetup. GenAI. https://www.meetup.com/applied-generative-artificial-intelligence-applications/ https://23orchard.com/ https://www.startupgrind.com/events/details/startup-grind-princeton-presents-ignite-change-build-generative-ai-for-non-profits/
March 15, 2024: TCF Pro. Princeton, NJ. IT Professional Conference at Trenton Computer Festival IEEE Information Technology Professional Conference on Friday, March 15th, 2024 https://princetonacm.acm.org/tcfpro/
March 27, 2024: Startup Grind. Jersey City https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-princeton-amp-nj-big-data-alliance-generative-ai-reverse-pitch/
March 28, 2024: Pinot + NiFi + Flink + Kafka Meetup NYC https://www.meetup.com/real-time-analytics-meetup-ny/events/299290822/
April 2, 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/
April 8-11, 2024: NLIT Summit. Seattle. https://www.fbcinc.com/e/nlit/default.aspx
April 11, 2024: Conf42 LLM. Virtual. https://www.conf42.com/llms2024
April 2024: AI Meetup NJ https://www.meetup.com/nj-gai/
May 8-9, 2024: Data Summit 2024. Boston, MA. https://www.dbta.com/DataSummit/2024/default.aspx
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe
- https://github.com/tspannhw/FLaNK-python-processors
- https://github.com/tspannhw/FLaNK-IceIceData
- https://github.com/tspannhw/PaK-Stocks
- https://github.com/tspannhw/meetups/tree/main/28feb2024
- https://github.com/SuperEllipse/LLM-demo-on-CML
- https://github.com/salesforce/LAVIS
- https://github.com/lm-sys/FastChat
- https://huggingface.co/Salesforce/blip-image-captioning-large
- https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
- https://github.com/BatsResearch/bonito
- https://github.com/mini-sora/minisora
- https://github.com/voxel51/fiftyone
- https://github.com/kadjoudi/Fraud-Prevention-With-Cloudera-SSB
- https://cldr-steven-matison.github.io/blog/SSB-Dead-Letter-Queue/
- https://sql-workbench.com/
- https://github.com/anitagraser/movement-analysis-tools
- https://github.com/ni1o1/transbigdata
- https://github.com/xoolive/traffic
- https://ipyleaflet.readthedocs.io/en/latest/
- https://altair-viz.github.io/
- https://github.com/InsightLab/PyMove
- https://ollama.com/library/starcoder2
- https://github.com/ubicloud/ubicloud
- https://www.waitingforcode.com/apache-flink/apache-flink-input-data-reading/read
- https://tunnelbroker.net/
- https://github.com/basecamp/kamal
- https://fmcheatsheet.org/
- https://github.com/allenai/wimbd
- https://github.com/linkedin/openhouse
- https://github.com/Data-Provenance-Initiative/Data-Provenance-Collection
- https://www.dataprovenance.org/
- https://github.com/rom1504/clip-retrieval
- https://github.com/allenai/wimbd
- https://github.com/lmmlzn/awesome-llms-datasets
- https://github.com/NVIDIA/TensorRT-LLM
- https://pigsty.io/
- https://github.com/dalibo/pg_activity
- https://github.com/dalibo/temboard
- https://github.com/daytonaio/daytona
- https://github.com/abetlen/llama-cpp-python
- https://github.com/PKU-YuanGroup/Open-Sora-Plan
- https://github.com/gptscript-ai/gptscript
- https://huggingface.co/docs/transformers/en/model_doc/mixtral
- https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1
- https://www.microsoft.com/en-us/research/blog/autogen-enabling-next-generation-large-language-model-applications/
- https://github.com/cloudera/CML_AMP_Deploy-Mistral7B-CML-Native-Model
- https://hertzbeat.com/
- https://github.com/LibrePDF/OpenPDF
- https://github.com/HeyPuter/puter
- https://github.com/vllm-project/vllm
- https://github.com/SillyTavern/SillyTavern
- https://github.com/datastax/ragstack-ai
- https://github.com/KhoomeiK/LlamaGym
- https://github.com/gingerbeardman/mandala
- https://www.usebruno.com/
- https://openddl.org/
- https://blog.research.google/2024/03/croissant-metadata-format-for-ml-ready.html
- https://stackoverflow.blog/2024/02/07/best-practices-for-building-llms/
- https://developers.google.com/edu/python
- https://github.com/Chleba/netscanner
- https://github.com/run-llama/llama_parse
- https://www.llamaindex.ai/blog/introducing-llamacloud-and-llamaparse-af8cedf9006b
- https://babelfishpg.org/getstarted/
- https://github.com/jdubois/2024-LangChain4J-demo
- https://github.com/tufin/oasdiff
© 2020-2024 Tim Spann
FLaNK Stack 04 March 2024
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #127 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.cloudera.com/solutions/dim-developer.html
Apache Kafka 3.7.0
https://kafka.apache.org/blog#apache_kafka_370_release_announcement
https://www.youtube.com/watch?v=mEsleV16qdo&ab_channel=freeCodeCamp.org
Yet another Python Processor https://medium.com/@tspann/yet-another-python-processor-45aaae6fe406
Streaming Street Cams to YoLo v8 with Python and NiFi to MinIO (S3) https://medium.com/@tspann/streaming-street-cams-to-yolo-v8-with-python-and-nifi-to-minio-s3-3277e73723ce
Meetup Report 28 Feb 2024 https://medium.com/@tspann/report-28-feb-2024-building-realtime-ai-applications-with-apache-flink-76edb957b996
Using OLLAMA with Mistral and Apache NiFi https://medium.com/@tspann/using-ollama-with-mistral-and-apache-nifi-720c17f5ff12
Python to Apache Iceberg https://medium.com/@tspann/python-to-apache-iceberg-s-5d642e1170ae https://www.youtube.com/watch?v=pRTNQ2Ddu88
Using Google Gemma https://medium.com/@tspann/google-gemma-for-real-time-lightweight-open-llm-inference-88efe98e580f
NYC Traffic?? (NiFi, Kafka, Flink) https://medium.com/@tspann/nyc-traffic-are-you-kidding-me-6d3fa853903b
Subways and Transit Updates in Real-Time https://medium.com/@tspann/subways-and-transit-updates-in-real-time-30c104c359ef
Open Source Data Infrastructure Meetup - Feb 2024 https://medium.com/@tspann/open-source-data-infrastructure-meetup-feb-2024-9e8048666828
https://towardsdatascience.com/all-public-transport-leads-to-utrecht-not-rome-bb9674600e81
https://datavolo.io/2024/02/collecting-logs-with-apache-nifi-and-opentelemetry/
https://zilliz.com/learn/milvus-vector-database-quickstart
https://echarts.apache.org/handbook/en/get-started/
https://www.decodable.co/blog/flink-sql-and-the-joy-of-jars?
https://www.philschmid.de/dpo-align-llms-in-2024-with-trl?
https://www.infoq.com/articles/architecting-java-persistence-patterns-and-strategies/
https://gonzoml.substack.com/p/big-post-about-big-context
https://ben11kehoe.medium.com/the-end-of-programming-will-look-a-lot-like-programming-8b877c8efef8
https://apiiro.com/blog/malicious-code-campaign-github-repo-confusion-attack/
https://vickiboykis.com/2024/02/28/gguf-the-long-way-around/
https://thenewstack.io/the-new-monitoring-for-services-that-feed-from-llms/?
https://nagarajtantri.medium.com/chaining-multiple-http-apis-via-apache-nifi-72c4d14c072d
Streaming Traffic Cameras https://www.youtube.com/watch?v=85ECRGJBEQU&ab_channel=DatainMotion-HowToBeaStreamingEngineer
Joining Three Kafka Topics in Flink SQL https://youtu.be/NI2n7uQJiP0?si=0aAFrkhOdqzZKisw
Continuous SQL with Kafka and Flink https://www.youtube.com/watch?v=0Fb8ggZlPrQ&ab_channel=stevecantrell
Building Real-time Pipelines: A Case Study by Transit Data https://www.youtube.com/watch?v=VjmC4J7KZgw&t=2s&ab_channel=Aiven
https://www.youtube.com/watch?v=29JnbO6LL6g
https://www.youtube.com/watch?v=0cdGwP3Shxs
https://www.youtube.com/watch?v=H7uUDLo_XI0
https://www.youtube.com/watch?v=awxzG7laWx4&ab_channel=Conf42
https://www.youtube.com/watch?v=FD16_oZ65Ug&ab_channel=Conf42
March 11, 2024: Princeton. Meetup. GenAI. https://www.meetup.com/applied-generative-artificial-intelligence-applications/ https://23orchard.com/
March 15, 2024: TCF Pro. Princeton, NJ. IT Professional Conference at Trenton Computer Festival IEEE Information Technology Professional Conference on Friday, March 15th, 2024 https://princetonacm.acm.org/tcfpro/
March 27, 2024: Startup Grind. Jersey City https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-princeton-amp-nj-big-data-alliance-generative-ai-reverse-pitch/
March 28, 2024: Pinot + NiFi + Flink + Kafka Meetup NYC https://www.meetup.com/real-time-analytics-meetup-ny/events/299290822/
April 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/
April 8-11, 2024: NLIT Summit. Seattle. https://www.fbcinc.com/e/nlit/default.aspx
April 11, 2024: Conf42 LLM. Virtual. https://www.conf42.com/llms2024
April 2024: AI Meetup NJ https://www.meetup.com/nj-gai/
May 8-9, 2024: Data Summit 2024. Boston, MA. https://www.dbta.com/DataSummit/2024/default.aspx
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe
- https://github.com/tspannhw/FLaNK-python-processors
- https://github.com/tspannhw/FLaNK-IceIceData
- https://github.com/tspannhw/PaK-Stocks
- https://github.com/tspannhw/meetups/tree/main/28feb2024
- https://github.com/SuperEllipse/LLM-demo-on-CML
- https://github.com/salesforce/LAVIS
- https://github.com/lm-sys/FastChat
- https://github.com/salesforce/LAVIS/blob/main/examples/blip_image_captioning.ipynb
- https://llm.mlc.ai/docs/
- https://docs.stackable.tech/home/stable/demos/data-lakehouse-iceberg-trino-spark.html
- https://traefik.me/
- https://hub.docker.com/r/gooddata/gooddata-cn-ce
- https://medium.com/plain-simple-software/software-engineer-to-devrel-a-guide-13d4bee97631
- https://huggingface.co/spaces/etri-vilab/KOALA
- https://github.com/Azure/PyRIT
- https://github.com/youngwanLEE/sdxl-koala
- https://shop.sb-components.co.uk/collections/raspberry-pi-pico/products/ardipi-uno-r3-alternative-board-based-on-pico-w
- https://aihub.qualcomm.com/models
- https://developer.nvidia.com/blog/build-an-llm-powered-api-agent-for-task-execution/
- https://github.com/NVIDIA/GenerativeAIExamples?nvid=nv-int-tblg-585510
- https://github.com/explodinggradients/ragas
- https://github.com/eladlev/autoprompt
- https://arxiv.org/abs/2402.17764
- https://github.com/ryogesh/llm-rag-graph
- https://github.com/pgvector/pgvector
- https://chartfox.org/
- https://github.com/AlmasB/FXGL
- https://github.com/bruin-data/ingestr
- https://github.com/baverman/sqlbind
- https://pgplayground.com/
- https://github.com/ytang07/ai_agents_cookbooks
- https://github.com/yerfor/GeneFacePlusPlus
- https://github.com/yerfor/Real3DPortrait
- https://pygments.org/demo/
- https://github.com/vectara/react-search
- https://github.com/deptofdefense/AndroidTacticalAssaultKit-CIV
- https://gist.github.com/loreanvictor/bddd8824c744024d338e935bd7e96707
- https://github.com/dotenv-org/dotenv-vault
- https://pql.dev/
- https://sql-workbench.com/
- https://slidesynth.com/
- https://github.com/slingdata-io/sling-cli
© 2020-2024 Tim Spann