Showing posts with label unstructured data processing. Show all posts
Showing posts with label unstructured data processing. Show all posts

AIM Weekly for 17 June 2024

 

17-June-2024

Tim Spann @PaaSDev

Milvus - Towhee - Attu - Feder - GPTCache - VectorDB Bench

Happy Father's Day To All! Also Happy Flag Day to Those in the United States.

image

AIM Weekly

Towhee - Attu - Milvus (Tim-Tam)

FLaNK - FLiPN

With a name like that I am not sure how I don't add that to my group.

SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search https://proceedings.neurips.cc/paper/2021/hash/299dc35e747eb77177d9cea10a802da2-Abstract.html

Congrats to Milvus https://www.dbta.com/Editorial/Trends-and-Applications/DBTA-100-2024-The-Companies-That-Matter-Most-in-Data-164289.aspx

https://github.com/milvus-io/milvus

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

https://ossinsight.io/analyze/tspannhw

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

https://www.meetup.com/unstructured-data-meetup-new-york/

This is Issue #142

New Releases

Milvus Release 2.4.4

https://milvus.io/docs/release_notes.md

Release date: May 31, 2024

It includes a critical bug fix, so if you use bulk insert definitely upgrade now. Also some compilation updates for other platforms.

https://github.com/milvus-io/milvus/releases/tag/v2.4.4

Hardware

https://www.seeedstudio.com/BeagleYr-AI-beagleboard-orgr-4-TOPS-AI-Acceleration-powered-by-TI-AM67A.html

Upcoming

Summary of the Last Awesome Meetup https://www.linkedin.com/feed/update/urn:li:activity:7202803256891248640/

Cool Stuff

TIMM (Pytorch Image Models) https://timm.fast.ai/

Ben has gone deep on this article on the Future of Vector Search linking to some of Milvus & Zilliz' scalability and horizontal scalability deep description and highlighting a lot of interesting things going on.

https://gradientflow.substack.com/p/the-future-of-vector-search

See: https://zilliz.com/learn/scaling-vector-databases-to-meet-enterprise-demands?utm_source=tim

Articles

There's a lot of cool stuff with Milvus and new models, techniques, libraries and use cases.

https://medium.com/@tspann/not-every-field-is-just-text-numbers-or-vectors-976231e90e4d

https://medium.com/@tspann/unstructured-street-data-in-new-york-8d3cde0a1e5b

https://medium.com/@tspann/tech-week-soft-meetup-debut-june-2024-fc4cdf79342d

https://medium.com/@tspann/shining-some-light-on-the-new-milvus-lite-5a0565eb5dd9

https://medium.com/@zilliz_learn/using-vector-search-to-better-understand-computer-vision-data-08e137df9c6c

https://www.infoq.com/presentations/ai-monopoly/

https://python.plainenglish.io/claude-3-the-king-of-data-extraction-f06ad161aabf

https://towardsdatascience.com/rag-vs-finetuning-which-is-the-best-tool-to-boost-your-llm-application-94654b1eaba7

https://medium.com/@zilliz_learn/using-vector-search-to-better-understand-computer-vision-data-08e137df9c6c

https://zilliz.com/blog/exploring-multimodal-embeddings-with-fiftyone-and-milvus

https://medium.com/aiguys/yolov9-new-object-detection-king-6fc97b93dc9a

https://www.linkedin.com/pulse/nifi-retrieval-augmented-generation-chris-gambino-gfsec/?trackingId=OliHLynzQEKic3SgAznO5w%3D%3D

https://ml.dssconf.pl/

https://medium.com/@zilliz_learn/milvus-reference-architectures-e30a27c9f3c2

https://docs.openlit.io/latest/integrations/milvus

https://builtin.com/articles/real-time-data-ai

https://medium.com/@zilliz_learn/image-embeddings-for-enhanced-image-search-an-in-depth-explainer-6831859bedf0

https://medium.com/@zilliz_learn/semantic-search-with-milvus-and-openai-32573de80307

https://medium.com/@zilliz_learn/how-to-detect-and-correct-logical-fallacies-from-genai-models-3e4a9852d2ef

https://gradientflow.substack.com/p/the-future-of-vector-search

https://medium.com/vector-database/introducing-pymilvus-integration-with-embedding-models-a82f10d516ea

https://medium.com/@zilliz_learn/local-agentic-rag-with-langgraph-and-llama-3-6c962979821f

https://blogs.nvidia.com/blog/nemotron-4-synthetic-data-generation-llm-training/?utm_source=tim

https://medium.com/walmartglobaltech/reliably-processing-trillions-of-kafka-messages-per-day-23494f553ef9

https://jack-vanlightly.com/blog/2024/6/10/a-cost-analysis-of-replication-vs-s3-express-one-zone-in-transactional-data-systems?utm_source=tim

DSPy - Getting interesting https://medium.com/@sandyeep70/the-decline-of-traditional-prompt-engineering-and-the-rise-of-dspy-b27b9a5adc45

Why How + Milvus Lite https://medium.com/enterprise-rag/kickstart-your-genai-applications-with-milvus-lite-and-whyhow-ais-open-source-rule-based-retrieval-70873c7576f1

Videos

Using JSON Fields with Milvus https://www.youtube.com/watch?v=HP5L3Hr6Mt8

Street Cams + Milvus https://medium.com/@tspann/unstructured-street-data-in-new-york-8d3cde0a1e5b

Conf42: ML: Emerging GenAI https://youtu.be/ktVVdJB306U?feature=shared

Generative AI with Milvus https://www.youtube.com/watch?v=IfWIzKsoHnA

SF Unstructured Meetup - 03 June 2024 https://www.youtube.com/watch?v=UobR3czXqSo&ab_channel=Zilliz

Fueling AI with Airbyte https://zilliz.com/event/fueling-ai-with-great-data-airbyte?utm_campaign=2024-06-13_webinar_Airbyte-fueling-ai-with-great-data_zilliz&utm_medium=tim

Milvus Webinar https://www.youtube.com/watch?v=IowBdkeKi_M

AI Generated Videos https://youtu.be/5tJDBSDrKLQ https://www.youtube.com/watch?v=YNh-WNFLe98

Voyage AI Embeddings and Rerankers for Search and RAG https://medium.com/@zilliz_learn/voyage-ai-embeddings-and-rerankers-for-search-and-rag-587d9bfff877

Evaluate RAG Apps https://medium.com/@zilliz_learn/how-to-evaluate-rag-applications-e2936c1275f9

Slides

https://www.slideshare.net/slideshow/generative-ai-on-enterprise-cloud-with-nifi-and-milvus/267678399

https://www.slideshare.net/slideshow/06-04-2024-nyc-tech-week-discussion-on-vector-databases-unstructured-data-and-ai/269523214

https://ml.dssconf.pl/user.html#!/lecture/DSSML24-041a/rate

https://www.slideshare.net/slideshow/dssml24_tspann_codelessgenerativeaipipelines/269634571

https://www.slideshare.net/slideshow/06-12-2024-budapestdataforum-buildingreal-timepipelineswithflank-aim/269645846

Events

June 18, 2024: Princeton Meetup https://www.meetup.com/applied-generative-artificial-intelligence-applications/events/301336510/ https://www.startupgrind.com/events/details/startup-grind-princeton-presents-genai-gathering/

June 20, 2024: AI Camp Meetup. NYC. https://www.meetup.com/unstructured-data-meetup-new-york/events/301383476/

Nov 5-7, 10-12, 2024: CloudX. Online/Santa Clara. https://www.developerweek.com/cloudx/

Nov 19, 2024: XtremePython. Online. https://xtremepython.dev/2024/

Code

Models

Tools

Cool

Let's do the Time Sync Again https://github.com/milvus-io/milvus/blob/master/docs/design_docs/20211215-milvus_timesync.md

© 2020-2024 Tim Spann https://www.youtube.com/@FLaNK-Stack 


🎥 Playlist:  Unstructured Data Meetup  https://zilliz.com/community/unstructured-data-meetup
🖥️ Website:  [(https://www.youtube.com/@MilvusVectorDatabase/videos
X Twitter -   / milvusio  https://x.com/milvusio
🔗 Linkedin:  / zilliz  https://www.linkedin.com/company/zilliz/
😺 GitHub: https://github.com/milvus-io/milvus
🦾 Invitation to join discord:   / discord https://discord.com/invite/FjCMmaJng6

FLaNK-AIM: 13 May 2024

 

13-May-2024

https://www.youtube.com/@FLaNK-Stack




FLaNK / KNIFe AI / FLaNK-AIM Weekly

Tim Spann @PaaSDev

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

**This is Issue #137 **

https://github.com/tspannhw/FLiPStackWeekly

https://www.cloudera.com/solutions/dim-developer.html

Articles

https://medium.com/@tspann/boston-wheres-my-bus-llm-streaming-to-the-rescue-586dfd019237

https://medium.com/@tspann/small-language-models-sml-for-the-win-ea0c6fee8061

https://medium.com/@tspann/maybe-four-smaller-open-llm-s-are-better-than-one-93f78fb69eb9

https://medium.com/@tspann/building-a-milvus-connector-for-nifi-34372cb3c7fa

https://medium.com/@tspann/searching-slack-from-apache-nifi-9ed562aa2397

https://milvus.io/blog/milvus-supports-apache-parquet-file-supports.md

https://cwiki.apache.org/confluence/display/NIFI/Release+Notes#ReleaseNotes-Version1.26.0

https://www.cloudera.com/about/news-and-blogs/press-releases/2024-05-07-cloudera-announces-data-in-motion-products-will-be-made-available-as-kubernetes-operators-for-red-hat-openshift.html

https://www.jetson-ai-lab.com/research.html#meeting-schedule

https://jack-vanlightly.com/blog/2024/5/7/learning-and-reviewing-system-internals-tactics-and-psychology

https://blog.min.io/the-future-of-ai-is-open-source/

https://dagshub.com/blog/common-pitfalls-to-avoid-when-using-vector-databases/

https://medium.com/@adam.bellemare/preventing-and-fixing-bad-data-in-event-streams-part-1-27bf2a99b48e

https://www.linkedin.com/pulse/milvus-community-newsletter-beginner-rag-guides-learn-ok3bc/

https://spring.io/blog/2024/05/09/spring-ai-structured-output

https://openai.com/index/instruction-following/

https://blog.miguelgrinberg.com/post/how-llms-work-explained-without-math

https://www.dell.com/en-uk/blog/the-rise-of-the-small-language-models-slms/

https://www.junaideffendi.com/p/netflix-data-tech-stack

Videos

Generative AI with Milvus https://www.youtube.com/watch?v=IfWIzKsoHnA

Four Models at Once https://youtu.be/xvNgsZyfo6A?si=zxwc9VcFc3o0vU3P

Slides

https://www.slideshare.net/slideshow/generative-ai-on-enterprise-cloud-with-nifi-and-milvus/267678399

Events

May 21, 2024: Gen AI and Beyond with NiFi 2.0. Virtual.

May 30, 2024: Conf42: Machine learning https://www.conf42.com/Machine_Learning_2024_Tim_Spann_enriching_generative_events

June 12, 2024: Budapest Data + ML Forum. Virtual. image https://budapestml.hu/2024/en/speakers/

image

June 20, 2024: AI Camp Meetup. NYC.

Sept 24, 2024: JConf.Dev. Dallas. https://2024.jconf.dev/session/598816

Nov 5-7, 10-12, 2024: CloudX. Online/Santa Clara. https://www.developerweek.com/cloudx/

Nov 19, 2024: XtremePython. Online. https://xtremepython.dev/2024/

tim_v2_1200_628python

Cloudera Events https://www.cloudera.com/about/events.html

https://www.cloudera.com/events/cloudera-now-cdp.html?internal_keyplay=ALL&internal_campaign=FY25-Q1-AMER-WS-Cloudera-Now-Events-Page-P06&cid=701Hr000000tW6qIAE&internal_link=p06

More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe

Code

Models

Tools

Cool Tool

New

https://ai.meta.com/blog/raft-llama-retrieval-augmented-generation-supervised-fine-tuning-microsoft/

© 2020-2024 Tim Spann https://www.youtube.com/@FLaNK-Stack FLaNK-AIM with LLAMA 3