All Data and AI Weekly 218-01 Dec 2025
( AI, Data, NiFi, Iceberg, Polaris, Streamlit, Flink, Kafka, Python, Java, SQL, MCP, LLM, RAG, Cortex AI, AISQL, Search, Unstructured Data )
🚀 NiFi + AI + AI Data Cloud + Iceberg. 🚀
Philly, Princeton, NYC and Youtube Events
https://github.com/tspannhw/TrafficAI/tree/main/Agents
https://github.com/tspannhw/conferences
Focus: Cloud Architecture, Snowflake Data Cloud, & AI Agents
Reflecting on core principles that shape modern software engineering.
- The 12-Factor App Revisited
- Why 12-Factor Application Patterns Matter & Deep Dive into 12-Factor – A refresher on the methodology for building software-as-a-service apps, emphasizing declarative setups and cloud-native resilience.
- Legends of Big Data
- A Bootiful Podcast: Tim Spann – A conversation with Big Data legend Tim Spann on the evolution of the Spring community and streaming data.
Major moves in Data Governance, Cortex AI, and Pipeline Engineering.
- Models & Agents: Snowflake has expanded its AI capabilities significantly.
- Search & Optimization:
- Cortex Search: Boosts & Decays – Fine-tuning search relevance.
- Cortex AI-to-SQL Optimization
- Compliance: OneTrust Partnership for Data-Level Compliance and Trust Center Extensions.
- Pipelines:
- Unstructured Data Pipeline Setup (SQL)
- Real-time Dashboards with Apache Superset
- Snowflake Flow Diff – A tool for comparing flow definitions.
- Semantic View Terraform Provider
- Knowledge Sharing:
Emerging tools for building autonomous agents and open-source models.
- Agent Frameworks:
- Pydantic AI Agents – A framework for building production-grade agents.
- FinRobot – An open-source AI agent platform specifically for finance.
- Asterisk AI Voice Agent – Real-time voice interaction capabilities.
- Model Control Protocol (MCP):
- MCP-UI Organization – User interface components for MCP.
- New Models:
- Segment Anything 3 (SAM3) – The latest image segmentation model from Meta.
Utilities to boost productivity and handle data at the edge.
- CLI & Terminal:
- Gemini CLI Tips – Mastering Google's AI from the command line.
- Parquet Tools – Essential utility for inspecting parquet files.
- Cmux – A terminal multiplexer for managing multiple streams.
- Applied AI:
- Building Transit Ridership Analysis with Cursor AI – A practical guide to using AI editors for data projects.
- Hardware/IoT:
- Meshtastic Devices – Open source, off-grid, decentralized mesh networking.
- Virtual Lab: Building Your First Multimodal Document Pipeline (Dec 4, 2025)
https://github.com/timothyspann
© 2020-2025 Tim Spann https://www.youtube.com/@FLaNK-Stack
