All Data and AI Weekly #203: 18-Aug-2025

 

All Data and AI Weekly

( AI, Data, NiFi, Iceberg, Polaris, Streamlit, Flink, Kafka, Python, Java, SQL, Unstructured Data )

#203: 18-Aug-2025

image

https://bsky.app/profile/paasdev.bsky.social

NiFi + AI + AI Data Cloud + Iceberg. b

https://www.reddit.com/r/DataEngineeringForAI/hot/

image

Monthly NYC and Youtube Events

https://lu.ma/PINSAI

image

Code and Open Source Projects

AWS New York Summit https://github.com/tspannhw/conferences/tree/main/2025/awsny

Hex + Snowflake Hackathon https://github.com/tspannhw/hackathons/tree/main/2025-07-15

Apache NiFi + AI Agents + Cortex AI + Snowflake AISQL

https://github.com/tspannhw/TrafficAI/tree/main/Agents

https://github.com/tspannhw/transit-ridership

https://github.com/tspannhw/conferences

https://github.com/tspannhw/hackathons/tree/main/2025-07-15

Articles

OpenFlow https://www.youtube.com/watch?v=e4h3NZ2IqPM

Vision LM https://www.liquid.ai/blog/lfm2-vl-efficient-vision-language-models

ML Tasks and Graphs https://quickstarts.snowflake.com/guide/e2e-task-graph/index.html?index=..%2F..index#1

AI Snowflake Chat App https://www.youtube.com/watch?v=gjlNtBaaxNo

Weekly Client Newsletter: The Latest in Data & AI

Issue Date: August 15, 2025

Welcome to our weekly roundup of the latest news, tutorials, and resources in the world of data and AI. This week, we're diving into performance optimization, new open-source tools, and upcoming events you won't want to miss.

Spotlight on Tim Spann

Developer Advocate Tim Spann has been busy! Check out his latest projects and contributions:

  • SnowAdmin-Scripts: A collection of useful administration scripts for Snowflake. View on GitHub
  • SnowConvertAI-ExampleGreenplumDB-CursorAI: An example project showcasing AI-powered database conversion. View on GitHub
  • PINSAI-Events: Resources and code for AI-driven event processing. View on GitHub
  • Workspaces: A repository for collaborative development environments. View on GitHub
  • MTA-Snowpipe-Streaming: Examples and guides for streaming data with Snowpipe. View on GitHub

📚 Articles & Blog Posts

  • Optimizing Iceberg Table Performance: A strategic guide to enhancing performance with Apache Iceberg tables. Read on Medium
  • Building a Real-Time LiveOps Platform on Snowflake: A deep dive into creating a live operations platform. Read on Medium
  • AI for Customer Analytics: How to gain a competitive advantage with AI-driven customer insights. Read on Snowflake Blog
  • Monitoring Snowflake Usage: A guide to access, credit optimization, and monitoring. Read on Medium
  • Efficient AWS Calls from Snowflake: Using vectorized UDFs with session caching for better performance. Read on Medium
  • GPT-OSS: A Guide to Snowflake's Cross-Region LLM Inference: Explore the capabilities of cross-region large language model inference. Read on Dev.to
  • Ingesting osquery Into Apache Phoenix using Apache NiFi: A technical walkthrough for security data pipelines. Read on Cloudera Community
  • Introducing Open SWE: An open-source, asynchronous coding agent from LangChain. Read on LangChain Blog
  • Use AISQL to Understand Customer Feedback: Analyze product feedback with the power of AI SQL. Read on Medium
  • Collaborative Development in Snowflake: Learn how to set up workspaces for team collaboration. Read on Medium
  • Snowflake Cortex AISQL: An introduction to using Cortex AI with SQL. Read on Medium
  • Programmatic Access to Snowpark Container Services: Making containerized services even easier to access. Read on Medium
  • Keep the Apache Spark Code, Change the Engine to Snowflake: A guide to migrating Spark workloads. Read on Medium
  • Building a Threat Intelligence Agent with Snowflake and Streamlit: Create powerful cybersecurity tools. Read on Medium
  • Securing Remote MCP Servers: Best practices for server security. Read on Medium
  • Snowflake Agents for Cybersecurity: Exploring the use of agents in cybersecurity applications. Read on Medium
  • Data Monetization in the Age of Gen AI: Insights from McKinsey on scaling intelligence. Read on McKinsey
  • Unlocking Audio Insights with Snowflake Cortex AI: Analyze audio data with Cortex AI. Read on Medium
  • Snowflake's New Secondary Roles Behavior: Understand the latest changes to role-based access. Read on Medium
  • Programmatically Accessing Snowflake Model Inference Endpoints: A simplified approach to model inference. Read on Medium
  • The Easy Button for Context-Rich AI Agents: Simplify the creation of advanced AI agents. Read on Snowflake Blog
  • Snowflake Event-Driven Alerts: A step-by-step guide for data quality monitoring. Read on Medium
  • Snowflake Latest and Greatest Updates (Spring 2025): A summary of the newest features. Read on Medium

📂 Code Repositories & Open Source

  • SnowConvertAI Greenplum DB Example: An example project for converting Greenplum databases using AI. View on GitHub
  • Snowflake Labs ML Jobs Samples: Sample machine learning jobs for Snowflake. View on GitHub
  • Getting Started with Cortex AISQL: A guide and notebook for using Cortex AI with SQL. View on GitHub
  • Magentic UI: A UI for interacting with large language models. View on GitHub
  • Synchrotron: A tool for synchronizing data. View on GitHub
  • Nexus by Grafbase: A powerful tool for building GraphQL servers. View on GitHub
  • dkim-verify: A library for DKIM verification. View on GitHub
  • nifi-osquery: Apache NiFi processors for osquery. View on GitHub
  • cti-bench: A benchmark for cyber threat intelligence. View on GitHub
  • Markitdown: A tool for working with Markdown. View on GitHub
  • langextract: A tool for extracting language information. View on GitHub
  • Snowflake Summit 2025 Resources: Code and resources from the Snowflake Summit. View on GitHub
  • Daft: A distributed query engine for Python. View on GitHub
  • ArcticDB: A high-performance, serverless database for Python. Visit Website

📄 Documentation & Guides

  • SQLMesh Documentation: Comprehensive documentation for SQLMesh. Read the Docs
  • Getting Started with Snowflake Cluster Key Selection: A Quickstart guide to optimizing clustering. View Guide
  • Create AI Agents on Snowflake with Lang AI: A Quickstart guide for building AI agents. View Guide
  • Getting Started with Snowflake Intelligence: An introductory guide. View Guide
  • Getting Started with Snowflake Intelligence and CKE: A guide for using Snowflake Intelligence with CKE. View Guide
  • Snowflake New Features: The latest release notes and feature announcements. Read the Docs

🗓️ Upcoming Events & Webinars

  • Snowflake World Tour - Berlin: October 1, 2025. Register Here
  • Snowflake World Tour - Chicago: October 6, 2025. Register Here
  • Snowflake World Tour - Paris: October 7, 2025. Register Here
  • Snowflake World Tour - London: October 9, 2025. Register Here
  • Data & AI Leadership Forum: Join leaders in data and AI. Learn More
  • Snowflake Demo Webinars: Live demos for the Americas region. View Schedule
  • Build a Better Enterprise Lakehouse with Apache Iceberg: October 15, 2025. Register for Webinar
  • Fast Prototyping of GenAI Apps with Streamlit: A course from DeepLearning.AI. Enroll Now

Sign up for these upcoming virtual events to get hands-on experience and deep-dive into new features.

Sept 12 - Community over Code - Minneapolis - https://communityovercode.org/schedule/

Build Nov 4-6 - https://www.snowflake.com/en/build/

image

November 6, 2025 - NODES Conference - Virtual I will be speaking about Snowflake and Neo4J at their conference. https://neo4j.com/nodes-2025/

Spann_-_From_Air_Quality_to_Aircraft_Automobiles_Data_Is_Everywhere_906274

🎬 Videos

💡 Other Interesting Links

  • Alltext NYC - About: The mission and story behind Alltext. Learn More
  • AI4Science at Georgia Tech: Advancing science with AI. Visit Website
  • Snowflake Data Sharing Rebate Program: Details on the data sharing rebate program. Learn More

https://sessionize.com/tspann

https://github.com/timothyspann

© 2020-2025 Tim Spann https://www.youtube.com/@FLaNK-Stack