FLaNK for 15 Jan 2024
15-January-2024
Happy Martin Luther King Jr. Day!
FLaNK Stack Weekly
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
Get your new Apache NiFi for Dummies!
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #120 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
https://www.cloudera.com/solutions/dim-developer.html
Articles
Using NiFi to Augment and Enrich LLM Results with Real-Time Contextual Data https://medium.com/@tspann/augmenting-and-enriching-llm-with-real-time-context-b6da7ba4960a
New Release of Cloudera Data Flow https://community.cloudera.com/t5/What-s-New-Cloudera/Cloudera-DataFlow-adds-Change-Data-Capture-processors-flow/ba-p/381727
10 Tips to Turbo Charge Streaming Stuff https://medium.com/@tspann/ten-tips-to-turbo-charge-your-streaming-b4749465ad48
A Cheat Sheet for RAG https://blog.llamaindex.ai/a-cheat-sheet-and-some-recipes-for-building-advanced-rag-803a9d94c41b
2023 Spring Projects https://spring.io/blog/2023/12/26/this-year-in-spring-2023/
K8 for Data Engineers https://dataengineeringcentral.substack.com/p/kubernetes-for-data-engineers
Ben Evans Slides for the Year https://www.ben-evans.com/presentations
Vector Database https://www.datanami.com/2024/01/04/how-real-time-vector-search-can-be-a-game-changer-across-industries/
Four Ways to Make a Docker Image https://typeshare.co/ciberkleid/posts/four-ways-from-java-code-to-docker-image
NiFi Python Dev https://nifi.apache.org/documentation/nifi-2.0.0-M1/html/python-developer-guide.html#requirements
GenAI Rag for NiFi https://datavolo.io/2024/01/why-genai-rag-is-homecoming-for-nifi/
NSA with NiFi https://media.defense.gov/2021/Aug/11/2002828754/-1/-1/0/TTP-SUCCESS-NIFI.PDF
Flink Session Improvements https://www.infoq.com/news/2024/01/doordash-flink-sessionization/
Videos
Looking at the New Features of Apache NiFi (Halifax Community over Code) https://www.youtube.com/watch?v=_orD9aAXk48&ab_channel=TheASF
Utilizing Real-Time Transit Data for Travel Optimization (Halifax Community over Code) Sunday Oct 8 2023, Canada https://www.youtube.com/watch?v=OWQmeF-UeEc&ab_channel=TheASF
Continuous SQL with Kafka and Flink | Timothy Spann (EN) https://www.youtube.com/watch?v=IGs0k240zhU&ab_channel=JAVAPRO
Events
Open Source Finance Forum. Virtual. https://resources.finos.org/znglist/osff-2023-virtual-presentations/?c=cG9zdDo5OTEzOTk%3D&utm_campaign=OSFF+NYC+2023&utm_content=269713979&utm_medium=social&utm_source=linkedin&hss_channel=lcp-18473937
April 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
Code
- https://github.com/tspannhw/FLaNK-CDW
- https://github.com/tspannhw/FLaNK-VectorDB
- https://github.com/tspannhw/FLaNK-RPI5
- https://github.com/tspannhw/FLaNK-EdgeAI
Tools
- https://docs.feast.dev/
- https://github.com/aeon-toolkit/aeon
- https://github.com/ofek/pyapp
- https://github.com/ml-explore/mlx
- https://github.com/vapoursynth/vapoursynth
- https://github.com/Unstructured-IO/unstructured
- https://github.com/m-bain/whisperX
- https://3dvisionlabs.com/2020/05/22/jetson-xavier-nx-compared-to-jetson-tx2-and-jetson-nano/
- https://hakibenita.com/fast-excel-python
- https://github.com/gorilla-llm/gorilla-cli
- https://github.com/simonw/llm
- https://github.com/1rgs/MeGPT
- https://trelliscope.org/
- https://github.com/deepflowio/deepflow
- https://wandb.ai/ml-colabs/fconn-yolo-nas/reports/Tackling-Water-Pollution-using-YOLO-NAS-and-WandB--Vmlldzo2MDEzMzk1/understanding-gpu-memory-2
- https://github.com/kyegomez/MultiModalMamba
- https://github.com/kevinbtalbert/Electric_and_Utilities_System_Demo
- https://github.com/janhq/jan
- https://github.com/graphql/graphiql
- https://github.com/DataSQRL/apiRAG
- https://github.com/DataSQRL/apiRAG/tree/main/api-examples/sensors
- https://unum-cloud.github.io/usearch/java/index.html
- https://marimo.io/
© 2020-2024 Tim Spann
FLaNK Weekly 08 Jan 2024
08-January-2024
This is the first issue of the year.
FLaNK Stack Weekly
Tim Spann @PaaSDev
https://www.youtube.com/@FLaNK-Stack
https://www.threads.net/@tspannhw
https://medium.com/@tspann/subscribe
Get your new Apache NiFi for Dummies!
https://www.cloudera.com/campaign/apache-nifi-for-dummies.html
https://ossinsight.io/analyze/tspannhw
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
**This is Issue #119 **
https://github.com/tspannhw/FLiPStackWeekly
https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
https://www.cloudera.com/solutions/dim-developer.html
Articles
10 Tips to Turbo Charge Streaming Stuff https://medium.com/@tspann/ten-tips-to-turbo-charge-your-streaming-b4749465ad48
Fraud Detection Pipeline Kafka, Pinecone https://redpanda.com/blog/fraud-detection-pipeline-redpanda-pinecone
Python Kafka Tutorial https://github.com/Aiven-Labs/python-apache-kafka-tutorial
LLMs with YoloPandas and Comet https://www.comet.com/site/blog/llms-exploring-data-with-yolopandas-and-comet/
Spring Boot with Virtual Threads (New Java) https://www.infoq.com/news/2023/12/spring-boot-virtual-threads/
Java Trends https://www.infoq.com/articles/java-trends-report-2023/?
Redis Gen AI http://antirez.com/news/140
OCR on a Mac https://blog.greg.technology/2024/01/02/how-do-you-ocr-on-a-mac.html
Faster AI Inference https://vgel.me/posts/faster-inference/#Continuous_Batching
Get Started with MQTT + NiFi Now! https://medium.com/cloudera-inc/getting-started-with-mqtt-in-apache-nifi-64e8cde1de91
Get Started with PyFlink Now! https://www.ververica.com/blog/all-you-need-to-know-about-pyflink
Try to be anonymous online https://www.wired.com/story/how-to-be-more-anonymous-online/
Kafka with Avro, Protobuf, JSON https://simon-aubury.medium.com/kafka-with-avro-vs-kafka-with-protobuf-vs-kafka-with-json-schema-667494cbb2af
Videos
https://www.youtube.com/watch?v=IGs0k240zhU&ab_channel=JAVAPRO
https://youtu.be/iw_F0nGanL0?si=ZMP-sdPvX8jL_ght
Events
Open Source Finance Forum. Virtual. https://resources.finos.org/znglist/osff-2023-virtual-presentations/?c=cG9zdDo5OTEzOTk%3D&utm_campaign=OSFF+NYC+2023&utm_content=269713979&utm_medium=social&utm_source=linkedin&hss_channel=lcp-18473937
April 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/
Cloudera Events https://www.cloudera.com/about/events.html
More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/
Code
- https://github.com/tspannhw/FLaNK-CDW
- https://github.com/tspannhw/FLaNK-EveryTransitSystem
- https://github.com/tspannhw/FLaNK-Ice
Models
Data
Tools
- https://github.com/cmang/durdraw
- https://github.com/gunnarmorling/1brc
- https://github.com/cmang/gifterm
- https://github.com/adaptive-scale/dbchaos
- https://github.com/oracle-samples/sd4j
- https://github.com/schedule-x/schedule-x
- https://github.com/ktock/container2wasm
- https://github.com/accessd/terminal-sunday
- https://starlight.astro.build/getting-started/
- https://github.com/basecamp/solid_queue
- https://tribuo.org/
- https://github.com/tensorflow/java
- https://onnxruntime.ai/getting-started
- https://github.com/tyxsspa/AnyText
- https://github.com/hrvach/deskhop
- https://github.com/sharkdp/hyperfine
- https://github.com/wasmerio/wasmer-java
- https://github.com/triton-inference-server/server/blob/main/docs/getting_started/quickstart.md
- https://podman-desktop.io/downloads
- https://github.com/ultralytics/ultralytics
- https://github.com/upscayl
- https://github.com/RizwanMunawar/yolov7-object-tracking
- https://steampipe.io/
- https://github.com/sroecker/LLM_AppDev-HandsOn
- https://readine.app/
- https://heynote.com/
- https://github.com/jasonjmcghee/rem
- https://www.catscloudsanddata.com/data
- https://github.com/mindsdb/mindsdb
- https://pikchr.org/home/doc/trunk/homepage.md
- https://github.com/MegaManSec/SSH-Snake
- https://github.com/GuntherRademacher/rr
- https://www.ibm.com/docs/en/tncm-p/1.4.4?topic=models-nifi-record-path-expression
- https://antonz.org/in-browser-code-playgrounds/
- https://github.com/dgarnitz/vectorflow
- https://nitro.jan.ai/
- https://github.com/tconbeer/harlequin
- https://harlequin.sh/
- https://github.com/YS-L/csvlens
- https://worldwind.arc.nasa.gov/web/get-started/#anchor
- https://github.com/MegaManSec/LDAP-Monitoring-Watchdog
- https://github.com/warmcat/libwebsockets
- https://github.com/saubury/kafka-serialization
- https://nino.app/
- https://www.dns.toys/
- https://github.com/pocketbase/pocketbase
- https://zserge.com/posts/ai-eliza/
© 2020-2024 Tim Spann