Data In Motion

Unstructured Data Processing Meetup NYC - 25 July 2024 -

South Bay Unstructured Data Meetup July 23 2024

AIM Weekly for 22 July 2024

22-July-2024

Tim Spann @PaaSDev Milvus - Towhee - Attu - Feder - GPTCache - VectorDB Bench

AIM Weekly (Towhee - Attu - Milvus (Tim-Tam))

https://github.com/milvus-io/milvus?utm_source=partner&utm_medium=referral&utm_campaign=2024_newsletter_tspann-ai-newsletters_external

https://www.youtube.com/@FLaNK-Stack

https://medium.com/@tspann/subscribe

https://ossinsight.io/analyze/tspannhw

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

https://www.meetup.com/unstructured-data-meetup-new-york/?utm_source=partner&utm_medium=referral&utm_campaign=2024_newsletter_tspann-ai-newsletters_external

This is Issue #147

Join me:

July 25, 2024 5:30 to 8:30 PM in NYC @ Cloudera 101 5th Ave · New York, NY Cloudera office - 8th Floor https://www.meetup.com/unstructured-data-meetup-new-york/events/301720478/?utm_source=partner&utm_medium=referral&utm_campaign=2024_newsletter_tspann-ai-newsletters_external

Edge News

I may have some really cool edge stuff coming for August, looks like we may have some next gen Edge LLM RAG going on.

Articles

There's a lot of cool stuff with Milvus and new models, techniques, libraries and use cases.

BM25 https://medium.com/@tspann/ranking-for-relevance-with-bm25-b2d9dd62e2f8

Edge AI with Milvus Lite https://medium.com/@tspann/edgeai-edge-vector-database-6a9b5238bffb

Quantization!?!?!? https://medium.com/@tspann/how-good-is-quantization-in-milvus-6d224b5160b0

MULTI-SPECTRAL REMOTE SENSING IMAGE RETRIEVAL USING GEOSPATIAL FOUNDATION MODELS https://arxiv.org/html/2403.02059v1

Generative AI Develop LLM Powered Apps with LangChain, Python and Milvus https://rajatnigam89.medium.com/generative-ai-develop-llm-powered-applications-with-langchain-python-and-milvus-vectordb-5b796c0c05e3

Milvus with ML https://www.msn.com/en-gb/money/technology/vijay-bhaskar-kamireddy-revolutionizing-machine-learning-with-innovation-and-expertise/ar-BB1p6ENP

The 2024 Playbook Top Use Cases for Vector Search https://medium.com/@zilliz_learn/the-2024-playbook-top-use-cases-for-vector-search-2d61b4e6db81

PDF Parsing with LMM https://ai.gopubby.com/demystifying-pdf-parsing-04-ocr-free-large-multimodal-model-based-method-0fdab50db048

Diff PDF https://github.com/vslavik/diff-pdf

Why has Autonomous Driving Failed? https://artificio.org/blog/why-has-autonomous-driving-failed-perspectives-from-peru-and-insights-from-neuroai

Voxel 51 https://voxel51.com/try-fiftyone/

PyMilvus Integrations with Embedding Models https://milvus.io/blog/introducing-pymilvus-integrations-with-embedding-models.md

Changes of Embeddings During Fine Tuning https://itnext.io/changes-of-embeddings-during-fine-tuning-c22aa1615921

Awesome Lists

https://github.com/huggingface/transformers/blob/main/awesome-transformers.md

What is RAG? https://zilliz.com/learn/Retrieval-Augmented-Generation

Videos

Live Fun Friday with Unstructed Data Preview https://www.youtube.com/watch?v=_jQB62uPsvc

Running the NVIDIA Milvus Lite Demo https://www.youtube.com/watch?v=7kdYbaw2LSQ

Slides

https://www.slideshare.net/slideshow/06-20-2024-ai-camp-meetup-unstructured-data-and-vector-databases/269789268

https://www.slideshare.net/slideshow/06-18-2024-princeton-meetup-introduction-to-milvus/269765983

https://www.slideshare.net/slideshow/06-12-2024-budapestdataforum-buildingreal-timepipelineswithflank-aim/269645846

Releases

https://github.com/huggingface/transformers/releases/tag/v4.42.0

Events

Oct 27 - 29, Raleigh, NC - All Things Open https://2024.allthingsopen.org/speakers/timothy-spann

Nov 5-7, 10-12, 2024: CloudX. Online/Santa Clara. https://www.developerweek.com/cloudx/

Nov 19, 2024: XtremePython. Online. https://xtremepython.dev/2024/

Webinars

Building an Agentic RAG locally with Milvus, Ollama and LangGraph July 11, 2024 | 9:00 AM PT/12:00PM ET | Stephen Batifol, Zilliz Get hands-on and learn how to:

Enable agent planning, memory, and tool use for tasks
Allow LLM web searches and custom function calls
Implement fallbacks and self-correction for agent errors https://zilliz.com/event/rag-agents-with-langchain-and-milvus?utm_campaign=tim

RAG Evaluation with Ragas July 18 | 9:00 AM PT/12:00PM ET | Christy Bergman, Zilliz

Evaluate a RAG pipeline using metrics like context F1-score and answer correctness, then learn the differences between:
Foundation model evaluation vs RAG evaluation
Human evaluation vs LLM-as-a-judge evaluations
Overall RAG vs RAG component evaluations https://zilliz.com/event/rag-evaluation-with-ragas?utm_campaign=tim

Hands-On Demo: Building and Scaling Vector Search Apps with Zilliz Cloud July 25, 2023 | 9:00 AM PT/12:00PM ET | Frank Liu, Zilliz Learn how to build and scale vector search applications with live examples. Walk through the following:

Live Zilliz Cloud setup and configuration
Building a simple chatbot step-by-step
Advanced search techniques with examples https://zilliz.com/event/hands-on-zilliz-cloud-demo?utm_campaign=tim

Join me at my first Milvus Webinar!!!!!!

Thursday, August 1 9:00 AM Pacific | 12:00 PM Eastern 45-minute presentation | 15-minute Q&A https://zilliz.com/event/unstructured-data-processing-from-cloud-to-edge

Code

Models

Tools

© 2020-2024 Tim Spann https://www.youtube.com/@FLaNK-Stack


🖥️ Videos:  [https://www.youtube.com/@MilvusVectorDatabase/videos](https://www.youtube.com/@MilvusVectorDatabase/videos)
X Twitter -   / milvusio  [https://x.com/milvusio](https://x.com/milvusio)
🔗 Linkedin:  / zilliz  [https://www.linkedin.com/company/zilliz/](https://www.linkedin.com/company/zilliz/)
😺 GitHub: [https://github.com/milvus-io/milvus](https://github.com/milvus-io/milvus)
🦾 Invitation to join discord:   / discord  [https://discord.com/invite/FjCMmaJng6](https://discord.com/invite/FjCMmaJng6)

LUMA AI Generated Video #10

LUMA AI Generated Video #9

LUMA AI Generated Video #8

LUMA AI Generated Video #7

LUMA AI Generated Promo #6

AIM Weekly for 01-July-2024

Tim Spann @PaaSDev Milvus - Towhee - Attu - Feder - GPTCache - VectorDB Bench

AIM Weekly (Towhee - Attu - Milvus (Tim-Tam))

https://github.com/milvus-io/milvus?utm_source=partner&utm_medium=referral&utm_campaign=2024_newsletter_tspann-ai-newsletters_external

https://www.youtube.com/@FLaNK-Stack

https://medium.com/@tspann/subscribe

https://ossinsight.io/analyze/tspannhw

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

https://www.meetup.com/unstructured-data-meetup-new-york/?utm_source=partner&utm_medium=referral&utm_campaign=2024_newsletter_tspann-ai-newsletters_external

This is Issue #144

Join me:

July 25, 2024 5:30 to 8:30 PM in NYC @ Cloudera 101 5th Ave · New York, NY Cloudera office - 8th Floor https://www.meetup.com/unstructured-data-meetup-new-york/events/301720478/?utm_source=partner&utm_medium=referral&utm_campaign=2024_newsletter_tspann-ai-newsletters_external

New Releases

Zilliz Cloud https://docs.zilliz.com/docs/release-notes-290

Unity Catalog https://github.com/unitycatalog/unitycatalog/

Hardware

Necklace AI? https://basedhardware.com/

Upcoming

July 25 - Meetup @ Cloudera NYC August 13 - Meetup @ Hudson Yards NYC

Cool Stuff

Hardware coming... Transformer enhancement... Sohu

I wonder if this will supercharge Vector Databases?

https://www.etched.com/

Tip

Milvus Lite is for only one vector per collection. As of current version in 2.4.

Articles

There's a lot of cool stuff with Milvus and new models, techniques, libraries and use cases.

Edge AI with Milvus Lite https://medium.com/@tspann/edgeai-edge-vector-database-6a9b5238bffb

Quantization!?!?!? https://medium.com/@tspann/how-good-is-quantization-in-milvus-6d224b5160b0

Vector Embeddings https://zilliz.com/learn/everything-you-should-know-about-vector-embeddings?utm_source=tim

Milvus Lite with LangChain and LLaMaIndex https://medium.com/@zilliz_learn/how-to-connect-to-milvus-lite-using-langchain-and-llamaindex-69ed139c7e4b

Choosing the Right Embedding Model for Your Data https://zilliz.com/blog/choosing-the-right-embedding-model-for-your-data

How Delivery Hero Implemented Safety System for AI https://zilliz.com/blog/how-delivery-hero-implemented-safety-system-for-ai-generated-images?utm_source https://www.slideshare.net/slideshow/i-see-eyes-in-my-soup-how-delivery-hero-implemented-the-safety-system-for-ai-generated-images/267924072

Local Agentic RAG with Langraph and LLAMA3 https://zilliz.com/blog/local-agentic-rag-with-langraph-and-llama3?utm_source=partner&utm_medium=referral&utm_campaign=2024_newsletter_tspann-ai-newsletters_external

Mastering LLM techniques https://developer.nvidia.com/blog/mastering-llm-techniques-inference-optimization/#in-flight_batching

Milvus Performance Benchmark for Vector Databases https://zilliz.com/resources/whitepaper/milvus-performance-benchmark

Vector Search and RAG Balancing Accuracy https://zilliz.com/blog/vector-search-and-rag-balancing-accuracy-and-context?utm_source=li

Promethean Wager AI Vector Databases https://severalnines.com/podcast/promethean-wager-ai-vector-databases-and-data-sovereignty

AI https://news.ycombinator.com/item?id=40789353

Attention Explained https://ai-explained.yoko.dev/1-attention-explained

Polyfill Chain Attack https://sansec.io/research/polyfill-supply-chain-attack

What we learned from Pinterests Text to SQL Solution https://blog.getwren.ai/what-we-learned-from-pinterests-text-to-sql-solution-840fa5840635

The Ultimate Guide to Run Any LLM Locally https://programming.earthonline.us/an-ultimate-guide-to-run-any-llm-locally-eb1a43052053

Drop of a Hat Model https://universe.roboflow.com/test-y7opj/drop-of-a-a-hat/model/2 https://dropofahat.zone/

Structured Output From LLMs https://www.boundaryml.com/blog/structured-output-from-llms

The Death of NYC Congestion Pricing https://www.apricitas.io/p/the-death-of-nyc-congestion-pricing

Finding GPT4S Mistakes with GPT-4 https://openai.com/index/finding-gpt4s-mistakes-with-gpt-4/

Finetuning https://mlops.systems/posts/2024-06-25-evaluation-finetuning-manual-dataset.html

Enterprise RAG at Scale https://medium.com/@dialoglk/asimov-leveraging-rag-models-for-enhanced-efficiency-in-the-telecommunications-engineering-domain-f220fc405571

Try Milvus 2.4 on Zilliz https://www.linkedin.com/pulse/try-milvus-24-features-zilliz-cloud-learn-vector-embeddings-check-b9fyc/

All the Free AI Education https://zilliz.com/learn

Multimodal Embeddings with Fifty One and Milvus https://zilliz.com/blog/exploring-multimodal-embeddings-with-fiftyone-and-milvus

https://github.com/milvus-io/bootcamp/blob/master/bootcamp/OpenAIAssistants/custom_RAG_workflow.ipynb

https://zilliz.com/blog/use-vector-search-to-better-understand-computer-vision-data?utm_source=linkedin&utm_medium=social%20&utm_campaign=2024-06-26_social_linkedin-newsletter_zilliz

Elevating User Experience with Image Based Fashion Recommendations https://zilliz.com/blog/elevating-user-experience-with-image-based-fashion-recommendations?utm_source=linkedin&utm_medium=social%20&utm_campaign=2024-06-26_social_linkedin-newsletter_zilliz

Training State of the Art General Text Embedding https://www.slideshare.net/slideshow/training-stateoftheart-general-text-embedding/267310506

Fine Tune Florence 2 https://huggingface.co/blog/finetune-florence2

AI Data Infrastructure https://www.felicis.com/insight/ai-data-infrastructure

RAG with Small Language Models https://medium.com/data-science-at-microsoft/evaluating-rag-capabilities-of-small-language-models-e7531b3a5061

Synthetic Data Generation https://blogs.nvidia.com/blog/nemotron-4-synthetic-data-generation-llm-training/

Deep Dive into RAG https://towardsdatascience.com/17-advanced-rag-techniques-to-turn-your-rag-app-prototype-into-a-production-ready-solution-5a048e36cdc8

Videos

Live Fun Friday with Unstructed Data Preview https://www.youtube.com/watch?v=_jQB62uPsvc

Running the NVIDIA Milvus Lite Demo https://www.youtube.com/watch?v=7kdYbaw2LSQ

RAG in Production https://www.youtube.com/watch?v=_MpqlnN-TtE

Unstructured Meetup https://www.youtube.com/watch?v=ntiA36Skdrw

Princeton AI Meetup 18-June-2024 Report

https://www.yourtowntube.com/video/16798/ai-startup-grind-princeton-meetup-somerset-entire-opening-screen-presentation

https://www.yourtowntube.com/video/16793/ai-meetup-event-somerset-innovation-and-technology-center-some-clips

https://www.yourtowntube.com/video/16799/ai-startupgrind-princeton-meetup-somerset-individual-interviews-briana

AI Camp NYC - 20-June-2024 - Tim Speaks -With Slides https://www.youtube.com/watch?v=2YQiJzwA6BE

AI Camp NYC - 20-June-2024 - Tim Speaks - Raw video feed https://www.youtube.com/watch?v=wYEtg4UuvPM

Unstructured Data Processing with RPI 5 AI Kit https://www.youtube.com/watch?v=tZFJ1DDkD1Q

Using JSON Fields with Milvus https://www.youtube.com/watch?v=HP5L3Hr6Mt8

DSS ML Talk https://www.youtube.com/watch?v=t17Ga4l5gvo

Webinar https://zilliz.com/event/asimov-enterprise-rag-at-dialog-axiata?

Slides

https://www.slideshare.net/slideshow/06-20-2024-ai-camp-meetup-unstructured-data-and-vector-databases/269789268

https://www.slideshare.net/slideshow/06-18-2024-princeton-meetup-introduction-to-milvus/269765983

https://www.slideshare.net/slideshow/06-12-2024-budapestdataforum-buildingreal-timepipelineswithflank-aim/269645846

Events

Oct 27 - 29, Raleigh, NC - All Things Open https://2024.allthingsopen.org/speakers/timothy-spann

Nov 5-7, 10-12, 2024: CloudX. Online/Santa Clara. https://www.developerweek.com/cloudx/

Nov 19, 2024: XtremePython. Online. https://xtremepython.dev/2024/

Webinars

Building an Agentic RAG locally with Milvus, Ollama and LangGraph July 11, 2024 | 9:00 AM PT/12:00PM ET | Stephen Batifol, Zilliz Get hands-on and learn how to:

Enable agent planning, memory, and tool use for tasks
Allow LLM web searches and custom function calls
Implement fallbacks and self-correction for agent errors https://zilliz.com/event/rag-agents-with-langchain-and-milvus?utm_campaign=tim

RAG Evaluation with Ragas July 18 | 9:00 AM PT/12:00PM ET | Christy Bergman, Zilliz

Evaluate a RAG pipeline using metrics like context F1-score and answer correctness, then learn the differences between:
Foundation model evaluation vs RAG evaluation
Human evaluation vs LLM-as-a-judge evaluations
Overall RAG vs RAG component evaluations https://zilliz.com/event/rag-evaluation-with-ragas?utm_campaign=tim

Hands-On Demo: Building and Scaling Vector Search Apps with Zilliz Cloud July 25, 2023 | 9:00 AM PT/12:00PM ET | Frank Liu, Zilliz Learn how to build and scale vector search applications with live examples. Walk through the following:

Live Zilliz Cloud setup and configuration
Building a simple chatbot step-by-step
Advanced search techniques with examples https://zilliz.com/event/hands-on-zilliz-cloud-demo?utm_campaign=tim

Code

Models

Tools

© 2020-2024 Tim Spann https://www.youtube.com/@FLaNK-Stack

Subscribe to: Posts (Atom)