Brainberg

Data & Analytics Events in Europe

The modern data stack has a vocal European community. This page collects events for the engineers, analysts, and analytics engineers doing the work: Databricks and Snowflake user groups, Microsoft Fabric and Power BI community meetups (a particularly active series across the continent), dbt meetups, Airflow and dagster practitioner events, PyData conferences, data-engineering guild nights, PostgreSQL and PGConf events, and the growing list of "we ship a data product" meetups at scaleups across the continent.

Subjects trend practical: cost control on cloud warehouses, incremental models, lineage and testing, moving from batch to streaming (Kafka, Flink, ClickHouse), data contracts, semantic layers, lakehouse architectures, and the politics of data teams inside larger engineering orgs. BI and product-analytics events sit here too: Power BI community groups, Tableau, Hex, Amplitude, PostHog, Mixpanel communities, plus web-analytics gatherings like MeasureCamp. Anchor conferences include Big Data Minds, Data Innovation Summit, Business Intelligence Summit, data2day, and Berlin Buzzwords.

Berlin and Amsterdam are probably the densest hubs in Europe for this category, with Berlin leaning toward data engineering and Amsterdam toward analytics engineering. London, Dublin, Paris, Stockholm, Copenhagen, and the Baltics all sustain active groups as well. Brainberg consolidates these into a single feed, so you don't have to chase ten community mailing lists to find the next evening meetup in your city.

Upcoming events

Data & AnalyticsMeetupFreeOnline

(Online) Apache Airflow: DAG Design, Monitoring, Lessons from the Field

This is an Online event, the Teams link will be published on the right of this page for those who have registered.
18:30: Apache Airflow in Production: DAG Design, Monitoring, and Lessons from the Field - Pradeep Kalluri
19:55 Prize Draw - Packt eBooks

Session details:
Apache Airflow is the world's most widely deployed workflow orchestration platform — but running it reliably in production requires more than just writing DAGs.

In this session, Pradeep Kalluri, a Data Engineer at NatWest Bank and Apache Airflow open-source contributor, shares practical lessons from building and operating Airflow at scale in cloud environments.

Topics covered:
- DAG design patterns that scale — avoiding common pitfalls in dependency management and task granularity
- Production monitoring and alerting — what to measure, how to set up SLA alerts, and how to debug failures fast
- Celery and Kubernetes executor deep-dives — choosing the right executor for your workload
- Lessons from open-source contribution — what contributing to Airflow taught me about how it really works under the hood
- Cloud deployment best practices — running Airflow on AWS and Azure with high availability

Whether you're just getting started with Airflow or managing complex multi-team pipelines, this session will give you actionable patterns you can apply immediately.

Speaker:
Pradeep Kalluri
Data Engineer | NatWest | Building Scalable Data Platforms
Data Engineer with 3+ years of experience building production data platforms at NatWest, Accenture, and Capgemini. Specialized in cloud-native architectures, real-time processing with Kafka and Spark, and data quality frameworks. Published technical writer on Medium, sharing practical lessons from production systems. Passionate about making data platforms reliable and trustworthy.

  • linkedin.com/in/pradeepkalluri
  • medium.com/@kallurip... (blog)
  • kalluripradeep.github.io (company)
Wed 6 May · 17:30< 50
Data & AnalyticsMeetupFreeOnline

Why Streamlit is the Missing Piece in Your Analytics Stack

Glasgow, 🇬🇧 United Kingdom

Ever found yourself wrestling with Power BI limitations or implementing complex workarounds for what should be simple user requests? Enter Streamlit – the open source Python package that's transforming how we deliver value from data.

In this session, you'll discover how Streamlit enables you to build beautiful, interactive data applications in minutes, not months, all without front-end development expertise. We'll showcase real-world applications we've built for clients that let users manipulate model parameters and instantly visualise impacts, delivering a level of engagement traditional BI tools simply can't match.

By the end of this talk, you'll have everything you need to create your first Streamlit app, along with practical guidance for taking your prototypes from desktop to Azure deployment.

See first hand how this transformative technology is helping organisations move users from passive dashboard consumers to active participants in the data experience.

🎯 Target Audience

  • Data analysts and BI developers frustrated with traditional tools
  • Python developers and data scientists building interactive apps
  • Consultants delivering data solutions to clients
  • Organisations seeking better data engagement and insights
  • IT professionals deploying apps (e.g. to Azure)

📍 Event Details
Location: The Gamer Club, Glasgow
Venue Info: A purpose-built basement venue for tech meetups — includes console lounge, arcade, kitchen/bar, PCs, projector, and gigabit internet.
💡 BYOB welcome | Free tea & coffee available
Directions: https://www.thegamerclub.co.uk/gettinghere
Date: 2026-02-04
Time: 18:30
Timezone: Europe/London

🎥 Streaming
We aim to stream this talk live here:
👉 Watch on YouTube

📬 Contact
If you have questions, get in touch:
📧 organisers@python.scot

💬 Matrix Chat
Matrix is an open, secure chat platform for communities and collaboration.
Join our Python Glasgow matrix room to chat with organisers and attendees:
https://matrix.to/#/#python:glasgow.social

Wed 6 May · 17:30< 50
Data & AnalyticsMeetupFree

PyData Exeter #13 - Open Source Community Talks @ Innovation Hub

Exeter, 🇬🇧 United Kingdom

Join us for a relaxed evening of free pizza, drinks and great talks on Python, data and open source software at PyData Exeter!

AGENDA

  • 18:45 - Doors open
  • 19:15 - Talks start
  • 20:30 - Talks finish. Stick around for a drink and networking at the CUCKOO taproom across the road.

SPONSORS

  • Exeter Innovation Hub
  • Butterfly Data
  • Mekion
  • NumFOCUS

Talk #1
Hugh Evans, "Mapping the PyData Community with Python and Web Scraping"

In this talk I'll be walking through my project building maps of the PyData community using data scraped from Meetup. Featuring insights into the PyData community, geo-encoding, map making with Folium, and a call to support your local PyData group.

Talk #2
Anna Andersson, "Organising a PyData Meetup with Ontologies: From Schema to Reasoning"

Ontologies provide a powerful way to move from raw data to structured, machine-understandable knowledge but for many practitioners, they remain abstract and difficult to apply. In this talk, we bridge that gap with a practical, hands-on example. Using the familiar scenario of organising a PyData meetup, we will build a simple ontology to model speakers, talks, venues, and community interactions. From there, we explore how this semantic layer enables reasoning — uncovering implicit relationships and validating assumptions in ways that traditional data models cannot. The goal of this talk is to make ontologies concrete, approachable, and useful, demonstrating how they can support better data integration, clearer thinking, and more intelligent systems in real-world workflows.

Talk #3
Venkata Prudhvi Kante, "Local Shops are closing at record speed. I walked past the empty units everyday - until the data told a surprising story"

I live in Exeter. I started counting empty shopfronts on my walks and every week there were more. So, I did what any data scientist would do. I scraped every UK retail closure going back a decade and built a dataset that didn't exist. What the data analysis uncovered surprised even me and I'll show you the real story behind why local shops are disappearing, and what I'm building to fight back.

SPEAKER DETAILS

  • Hugh Evans is a developer advocate and community manager with a particular interest in data and AI. He works in the streaming domain at Aiven where he helps to take care of the Kafka and ClickHouse communities. Out of office hours, he organises AI Signals, a community which hosts talks on real world applications of AI. Hugh is a former apprentice and an advocate for vocational learning as a pathway into an IT career. Here's Hugh's web page.
  • Anna Andersson is a data science and AI lead with a background across academia, startups, and consultancy. She has worked extensively with ontology-driven platforms, knowledge graphs, and applied machine learning, and enjoys building end-to-end AI systems with multidisciplinary teams. She is particularly interested in turning complex data into practical, explainable solutions, and has worked on projects across critical and regulated industries.
  • Venkata Prudhvi Kante is a Data Scientist who, by day, builds data pipelines, dashboards and machine learning models at South West Water. By night, he is an independent researcher, investigating patterns hidden in public records that the published literature hasn't explored yet. Holding an MSc in Business Analytics from the University of Exeter and five years of hand-on experience across Python, Azure and Databricks. His latest independent research on UK retail structural collapse - built entirely from public data and open-source Python, is available on GitHub.

CODE OF CONDUCT
The PyData Code of Conduct governs this meetup. To discuss any issues or concerns relating to the code of conduct or behaviour of anyone at the PyData meetup, please contact the PyData Exeter organisers, or you can submit a report of any potential Code of Conduct violation directly to NumFOCUS.

Wed 6 May · 17:45< 50
Data & AnalyticsMeetupFreeOnline

Designing Effective and Accessible Reports in Power BI with Matt Mair-Durrant

Microsoft Data Platform Group Birmingham

Thursday 7th May 2026 at 13:00 GMT

Session Title: Designing Effective and Accessible Reports in Power BI

Speaker: Matt Mair-Durrant

Session Abstract:
In this session, we’ll explore how to create Power BI reports that not only look great but also communicate insights clearly and inclusively.
Attendees will learn practical design techniques, from choosing effective layouts and colour palettes to leveraging visual hierarchy and whitespace to guide user attention. We’ll look at real‑world examples that demonstrate how thoughtful design decisions can improve both comprehension and engagement.
Accessibility is an essential but often overlooked aspect of report design. We’ll cover how to ensure your dashboards can be used by people with diverse needs, including guidance on colour contrast, keyboard navigation, alternative text, and designing for screen readers. Participants will walk away with actionable tips for balancing aesthetics, accessibility, and analytical effectiveness, ensuring every report delivers impact while remaining usable for all audiences.

Additional notes

Agenda:

12.45 - 13:00 : Speaker Setup and Join

13:00 - 13:05 : Open Lobby to guests and Introductions

13:05 - 14:00 : Designing Effective and Accessible Reports in Power BI

Venue: Wherever you have access to a computer or smart device!
This session will be online only!

Other Details:

*** Please note registration on MeetUp is required to gain access to the Teams link!***

Please contact us via email if you are having any issues joining, and we'll do everything we can to help.

Event Organiser Contact Details:

If you need any further details or have any requests for this or future Data Platform Group events, please get in touch.

Email: meetup@purplefrogsystems.com

X: @MSDataGroupBrum
BlueSky: @msdatagroupbrum.bsky.social

Sponsors:

Purple Frog Systems

Power BI Sentinel

RP Analytics

Thu 7 May · 12:00< 50
Data & AnalyticsMeetupFree

Lisbon Databricks - May 2026

Lisbon, 🇵🇹 Portugal

We’re excited to be relaunching the Databricks Lisbon user group with a friendly, in-person evening designed to bring together the local data and AI community.

Join us for an in-person event focused on what you need to know in Databricks right now, with Databricks MVP, Simon Whiteley sharing the latest updates, followed by stories from Databricks practitioners in Portugal on how they are embracing AI on Databricks in the real world.

Across the evening, we’ll explore how AI capabilities in Databricks, including Genie and Agent Bricks, are helping practitioners work more effectively and scale with confidence.
More than anything, this event is about community. It’s a chance to meet others working with Databricks in Portugal, swap ideas, hear how people are approaching similar challenges, and build lasting professional relationships in a friendly and welcoming setting.

We’ll open from 17:00 local time for arrivals and networking, with talks starting at 17:45, and the evening will wrap up at 20:00.

Location: Avila Spaces, 52 Avenida Dom João II, Lisboa, 1990-096

Agenda

17:00 – 17:45 | Check‑in & Welcome

17:45 – 18:00 | Kick‑off: Unbottling Genie
Databricks team – introduction & agenda.

18:00 – 18:20 | Genie Spaces & Metric Views
Semantic models to supercharge agents
Speaker: Lucas Ihnen, Solution Architect, Databricks.

18:20 – 18:50 | Customer Story: EDP & Genie
EDP’s journey with Genie
Speakers: Rui Bastos, GenAI Engineer, EDP, Carlos Vieira, Senior AI Engineer & Tech Lead, EDP.

18:50 – 19:20 | Genie Code Deep Dive
Hyper‑productivity for Data & AI teams
Speaker: Simon Whiteley, CTO Advancing Analytics & Databricks MVP.

  • AI‑augmented code in real projects
  • How Genie Code fits into the stack
  • Building a Metric View & a Genie Code Skill

19:20 – 19:30 | Wrap‑up & Q&A
Advancing Analytics team - Key takeaways, open questions, what’s next for the user group.
19:30 – 20:30 | Networking, Food & Drinks
Meet the speakers, Databricks team and community.

___

Estamos muito entusiasmados por dar uma nova vida ao Grupo de Utilizadores da Databricks em Lisboa, com um encontro presencial e descontraído pensado para juntar a comunidade local de dados e IA.

Junte-se a nós neste evento presencial focado no que é mais importante saber sobre Databricks neste momento. O Databricks MVP Simon Whiteley irá partilhar as mais recentes novidades, seguido de histórias de profissionais da Databricks em Portugal que estão a aplicar IA na plataforma em contextos reais.

Ao longo da tarde, vamos explorar de que forma as capacidades de IA na Databricks, incluindo o Genie e o Agent Bricks, estão a ajudar os profissionais a trabalhar de forma mais eficaz e a escalar com confiança.

Acima de tudo, este evento é sobre comunidade. É uma oportunidade para conhecer outras pessoas que trabalham com Databricks em Portugal, trocar ideias, ouvir como outros estão a enfrentar desafios semelhantes e criar relações profissionais duradouras, num ambiente informal e acolhedor.

As portas abrem às 17:00 para chegadas e networking, as palestras começam às 18:00, e o evento termina pelas 20:00.

Local: Avila Spaces, Avenida Dom João II, 52, Lisboa, 1990-096
Nota importante: A maior parte do conteúdo será apresentado em inglês.

Thu 7 May · 16:00< 50
Data & AnalyticsMeetupFree

Microsoft Fabric & Power BI Meetup | Meetup #8 | Wien Edition

Vienna, 🇦🇹 Austria

# MICROSOFT FABRIC USER GROUP AUSTRIA
Achtes Meetup | 07. Mai 2026 | ab 18:00 Uhr (vor Ort)

📍 Cloudflight GmbH (Walcherstraße 1A/Stiege 3, 3. Stock, 1020 Wien)
🍴 Snacks & Drinks
🖥️ Nur Vorort (kein Online)
💬 Session: Englisch

Wir freuen uns, euch zum achten Meetup der Microsoft Fabric User Group Austria einzuladen. Unsere Community trifft sich monatlich und rotiert dabei zwischen Wien, Graz und Linz.

Diesmal sind wir bei Cloudflight in Wien rein Vorort zu Gast.

AGENDA

18:00 – Ankommen · Leute kennenlernen · Snacks & Drinks genießen
18:45 – Begrüßung · Update: What's New
19:00Weaving CI/CD into Fabric: Implementing CI/CD Best Practices with fabric-cicd | Stefan Mikic, Cloudflight (englisch)
20:00 – Offene Runde · Fragen, Austausch & gemütlicher Ausklang

ABSTRACT

Weaving CI/CD into Fabric: Implementing CI/CD Best Practices with fabric-cicd - Stefan Mikic

Every great fabric starts with the right threads but what happens when your Microsoft Fabric workflows are held together by manual Git actions, slow deployments, and the occasional human error? Things start to unravel.

In this session, learn to stitch together a solution using the Python Library fabric-ci-cd, weaving CI/CD best practices across DEV, TEST, and PROD workspaces, with version control, automated pipelines, and complete isolation of PROD.

Come see it in action, and leave with the patterns you need to weave CI/CD into your own Fabric! 🧵🪡

Thu 7 May · 16:00< 50
Data & AnalyticsMeetupFree

Building Data Literacy in the Microsoft Data Platform Era

Königswinter, 🇩🇪 Germany

As spring reaches its peak in May, we are delighted to invite you to the next meetup of the DataMonsters.io Microsoft Data Platform Rhineland Regional Group on May 7, 2026.

Building Data Literacy in the Microsoft Data Platform Era

In today’s data-driven organizations, the ability to interpret, communicate, and make informed decisions with data has become as fundamental as technical infrastructure itself.

Yet, while many enterprises invest heavily in platforms like Microsoft Fabric and Power BI, the full potential of these technologies is only realized when users across all levels possess a strong foundation in data literacy.
This session explores how to cultivate, measure, and sustain data literacy within an organization that builds its analytical ecosystem on the Microsoft Data Platform.
We dive into the basics of Data Literacy, check Best Practises and give some tips how to measure with data support from tools like FUAM.

And also this meetup is the perfect platform to expand knowledge and build valuable connections within the community.

Additional info

Definitely stay until the end because we tend to end the event with pizza 🍕.

Königswinter has very good connections to public transport. From the Königswinter ferry stop on line 66 it is only a 5-minute walk to the office of oh22information services GmbH.

The parking spaces directly at the company are limited, but the Drachenfelsbahn car park can be reached in about 5 minutes on foot.
Please register for the event as the number of places is limited. If you are unable to attend, we ask that you cancel your registration early to enable others to participate.

We look forward to welcoming you to the Building Data Literacy in the Microsoft Data Platform Era Meetup and sharing an engaging and inspiring evening together.

Thu 7 May · 16:00< 50
Data & AnalyticsMeetupFreeOnline

🔺 How to Transition into Data Engineering Using Real Azure & Databricks Project

🔺From SQL to Data Engineering: What It Actually Takes (Live Session)

Thinking about transitioning into Data Engineering… but not sure where to start?

Most people:

Learn tools
Watch tutorials
Build small projects

But still struggle to answer:

👉 “Can you design and build a real data pipeline?”

📍In This Live Session, I’ll Walk You Through:

What companies actually expect from Data Engineers

A real end-to-end pipeline (Bronze → Silver → Gold) built using Databricks & Azure

The gap between “learning” and “getting hired”

Common mistakes that slow people down

📍Who This Is For:

SQL Analysts looking to move into Data Engineering

Data professionals stuck at the “learning stage”

Anyone curious about Databricks, PySpark, and cloud pipelines

📍What You’ll Leave With:

A clear roadmap to transition into Data Engineering

Understanding of what real-world projects look like

Whether this path is right for you

👩🏽‍💻 About the Host

Hosted by Joy Onuoha, a Data Professional specialising in:

Databricks & PySpark
Azure Data Engineering
End-to-end pipeline development

https://www.linkedin.com/in/joy-onuoha-ebedo-221aa0172?

📩 Next Steps

At the end of the session, I’ll share how you can build this type of project step-by-step with guidance — if you decide it’s the right fit.

(No pressure — this session is designed to give you clarity first.)

Sun 10 May · 17:00< 50
Data & AnalyticsMeetupFreeOnline

Domain-Driven Design zum Anfassen – Live Coding statt Buzzwords

DDD gehört zu den meistdiskutierten Konzepten in der Softwareentwicklung – aber Hand aufs Herz:

Wie oft bleibt es bei Begriffen wie Entities, Aggregates oder Bounded Contexts, ohne dass wirklich klar wird, wie das Ganze im Code aussieht?
Genau das ändern wir in diesem Meetup.

Gemeinsam tauchen wir in die Welt des taktischen Domain-Driven Designs ein – nicht theoretisch, sondern ganz praktisch:
Wir schreiben Live-Code und zeigen Schritt für Schritt, wie aus fachlichen Anforderungen sauber strukturierter, verständlicher und wartbarer Code entsteht.

Gemeinsam tauchen wir in die Welt des taktischen Domain-Driven Designs ein – nicht theoretisch, sondern ganz praktisch:
Wir schreiben live Code und zeigen Schritt für Schritt, wie aus fachlichen Anforderungen sauber strukturierter, verständlicher und wartbarer Code entsteht.

Was dich erwartet:

  • Verständlicher Einstieg – auch wenn du DDD bisher nur vom Hörensagen kennst
  • Konkrete Beispiele aus der Praxis
  • Live Coding: Vom Problem zur sauberen Modellierung
  • Diskussion & Austausch auf Augenhöhe

Egal ob du DDD-Neuling bist oder bereits erste Erfahrungen gesammelt hast – hier bekommst du ein Gefühl dafür, wie sich gutes Design wirklich anfühlt.
Speaker: Richard Wallintin

Und wie immer gilt: Gute Gespräche, neue Kontakte und ein entspannter Abend inklusive.

Zeitplan
- 1750 Zoom wird geöffnet
- 1800 Start des Termins mit Ankommen
- 1805 Talk
- 1900 Offizielles Ende (wer noch Fragen hat gerne länger)

Wer Ideen zu Themen für weitere Meetups hat, kann uns diese unter WPS software@work meetup zukommen lassen - Feedback erwünscht !

Tue 12 May · 16:00 – 00:0050–200
Data & AnalyticsMeetupFree

Belgium dbt Meetup #14

Brussels, 🇧🇪 Belgium

dbt Meetups are networking events open to all folks working with data! Talks predominantly focus on community members' experience with dbt, however, you'll catch presentations on broader topics such as analytics engineering, data stacks, data ops, modeling, testing, and team structures.

🏠Venue: BeCentral, Cantersteen 12, 1000 Brussels (same building as the Brussels Central Station)
🤝Organizers: Sam Debruyn
🅿️ Interparking Albertina is at 5min walking distance
🚂 It is literally inside Belgium's best connected train station

To attend, please read the Health and Safety Policy and Terms of Participation: https://www.getdbt.com/legal/health-and-safety-policy

Our venue has capacity limits, so please only RSVP if you intend to come. Reach out - send a message in #local-belgium on Slack - if you need to cancel last minute or change your RSVP status on the Meetup to "Not Going."

📝Agenda:

  • 18h00: welcome with food & drinks
  • 18h45: start presentations
  • 20h00: networking & drinks

🗣️Presentations:

Scaling dbt (core) Infrastructure at Lighthouse
This session will be about creating standardized dbt components (internal dbt package, reusable CICD template, ...) to scale the use of dbt core across an entire organisation with many repositories.

by Marthe Van Den Hende, Data Platform Engineer at Lighthouse

Marthe is a Data Platform Engineer at Lighthouse with hands-on experience implementing dbt in SaaS companies, focusing on scalability. Before Lighthouse, she worked as a Data Engineer at Markmi.ai.

Accelerating Data Engineering with dbt MCP Server
In this session, I’ll introduce a more experimental approach to data engineering using a dbt MCP server. We’ll explore how this setup enhances developer productivity by enabling smarter workflows, automation, and tighter integration between dbt and external tools.

I’ll share early learnings from real-world usage, including how MCP can streamline model development, improve context awareness, and reduce friction in day-to-day analytics engineering tasks. This session is ideal for those curious about the next evolution of dbt workflows and how emerging tooling can augment traditional SQL-based development.

by Vladyslav Ishchenko, Analytics Architect at Element61

Vladyslav holds an MBA from KU Leuven specializing in Business Information Management and brings strong multicultural experience from living in Japan, Vietnam, and South Korea. He specializes in Microsoft Azure cloud technologies, data engineering, and analytics, with expertise in SQL, Python, Synapse, Databricks, and Power BI, and holds multiple Azure certifications. At element61, he designs large-scale data pipelines and delivers data-driven solutions that unlock business value for customers.

We are always looking for speakers!
To submit a session for one of the next meetups, please use our Sessionize page.
Are you in doubt if you're ready to give a talk? Check out dbt Labs's guide on how to deliver a fantastic presentation!

➡️ Join the dbt Slack community: https://www.getdbt.com/community/
🤝 For the best Meetup experience, make sure to join the #local-belgium channel in dbt Slack (https://slack.getdbt.com/)!

dbt is the standard in data transformation, used by over 40,000 organizations worldwide. Through the application of software engineering best practices like modularity, version control, testing, and documentation, dbt’s analytics engineering workflow helps teams work more efficiently to produce data the entire organization can trust.

Learn more: https://www.getdbt.com/

Tue 12 May · 16:0050–200
Data & AnalyticsMeetupFree

Harnessing Data Magic: Let a Genie Find Your Needle in a Field of Haystacks!

Join us for an engaging virtual meetup for the Queens NY Databricks User Group (QNY-DUG)! 🎉💻

Join the group: https://usergroups.databricks.com/queens-new-york-databricks-user-group/

Whether you’re based in Queens, elsewhere in NYC, or joining from anywhere around the world, this session is designed to bring our community together and set the foundation for what’s ahead.

🗓️ Virtual Event Agenda

1️⃣ Welcoming Introduction (10 min).
2️⃣ Getting setup on Databricks Free Edition and Databricks Community (10 min).
3️⃣ Announcements and how to submit a talk (10 min).
4️⃣ Databricks' built AI expert Genie knows your data better than you! (30 min) A talk about how:

  • Databricks can help you understand your datasets.
  • Surface data learnings and understandings.
  • Facilitate the classic "Finding a needle in a field full of haystacks".

5️⃣ Networking and guest introductions (30 min). Join our meeting to:

  • Meet fellow technologist, hobbyists and professionals.
  • Share your background and aspirations.
  • Open Question and Answer time.

This is more than a meetup — it’s the beginning of a strong, inclusive data community centered in Queens with a global reach. 🌎🚀

We’re actively looking for passionate members who want to present on data engineering, analytics, AI, or ML in future sessions. If you’re interested in speaking at an upcoming virtual event, let us know!📅

Join us online, connect from wherever you are, and help shape the future of QNY-DUG. See you there!

Sun 17 May · 14:30< 50
Data & AnalyticsMeetupFree

From Open Data to Data Mesh

Milan, 🇮🇹 Italy

Hello PyData People!
We are excited to announce our next event of 2026! This time, we will be hosted at Generali Italia’s Torre Generali in Milan for an evening dedicated to open data, data platform orchestration, and scalable data engineering.
📅 When: Thursday, May 19th, 2026 – 18:30–21:00
📍 Where: Torre Generali Italia, Milan
⚠️ Important: Spots are limited. Please keep your RSVP updated to allow others to participate if you can no longer attend.


🕒 Agenda
18:30 – Doors open & check-in
19:00 – Talk 1: Democratizing Data: A deep dive into Eurostat Open Database – Simona Mazzarino
19:45 – Talk 2: Scaling Data Mesh Orchestration with Dagster: Platform-Driven DAGs without Platform Friction – Marco Santoni and Andrea Romeo
20:30 – Networking & Social Dinner


🎤 The Talks
1️⃣ Democratizing Data: A deep dive into Eurostat Open Database
Speaker: Simona Mazzarino (Data Scientist @ Clearbox AI)
In this talk, we explore how to turn open data into actionable insights. You will learn how to navigate Eurostat, the EU’s vast statistical database, and use Python tools to fetch and process data for research, AI, and real-world analysis. We will look at how to discover relevant datasets, work with Eurostat’s interface and APIs, and integrate public statistics into your workflows for visualizations, dashboards, and models. A guiding example will be the analysis of migration flows across European countries, from data discovery to a usable dataset for analytics or machine learning.
About the Speaker:
Simona Mazzarino is a Data Scientist at Clearbox AI. With a background in linguistics, semiotics, and artificial intelligence, she specializes in language technologies. She is an active volunteer in the Python Torino community, where she helps connect people interested in Python applications, and she also volunteers for PyCon Italia.


2️⃣ Scaling Data Mesh Orchestration with Dagster: Platform-Driven DAGs without Platform Friction
Speakers: Andrea Romeo, Platform Engineer @ TeamSystem
Data Mesh promises scalability and autonomy, but orchestration often becomes the hidden bottleneck: as data products grow, cross-domain dependencies explode, DAGs become fragile, and upstream teams are forced to react to every new consumer.
In this talk, we present a platform-designed orchestration solution built on top of Dagster that enables company-wide orchestration without coupling data products together. Instead of requiring data engineers to manually define DAGs or modify upstream pipelines, each data product simply declares its dependencies on upstream output ports. From these declarations, the platform automatically builds and maintains the global DAG behind the scenes, ensuring that upstream data products remain unaffected by new consumers, orchestration scales as the number of data products grows, and platform standards are enforced without slowing down teams.
We’ll walk through:

  • the orchestration challenges we faced when adopting data mesh at scale
  • how Dagster was extended and shaped into a platform abstraction, not exposed directly to product teams
  • the Python-based design patterns used to dynamically generate and evolve DAGs
  • the resulting developer experience for data engineers building data products

This session is a practical, production-tested story of how Python and Dagster can be used to enable scalable orchestration in a real data mesh implementation — without turning the platform team into a bottleneck.
About the Speakers:
Andrea Romeo is a Platform Engineer at TeamSystem, where he focuses on building and evolving the technical foundation behind data infrastructure. He works closely with development and data teams to design scalable, resilient platforms that support data-driven products and services, with a strong focus on backend, cloud technologies, and developer experience.


See you there!
The PyData Milano Team

Tue 19 May · 16:30< 50
Data & AnalyticsMeetupFree

Big-Data-Architekturen und Datenprodukte in der Praxis

Cologne, 🇩🇪 Germany

Willkommen zur nächsten INNOQ Technology Night in unserem Büro in Köln! Wir freuen uns, euch zu einem Abend voller Vorträge und angeregter Diskussionen einzuladen.

Unsere Speaker Christoph Windheuser und Stefan Negele haben diese Themen für euch im Gepäck: Die richtige Big-Data-Architektur für euer Unternehmen sowie praktische Heuristiken für den richtigen Schnitt von Datenprodukten. Der erste Vortrag startet um 19:00 Uhr. Nutzt die Pausen und die Zeit nach den Präsentationen für Networking und Austausch. Wir freuen uns auf einen tollen Feierabend mit euch!

18:30 | Doors open
19:00 | „Big-Data-Architekturen: Welcher Ansatz ist der richtige für euch?“ mit Christoph Windheuser (Databricks)
20:00 | „Datenprodukte richtig schneiden: Heuristiken für die Praxis“ mit Stefan Negele (INNOQ)
21:00 | Netzwerken

Die Talks:

"Big-Data-Architekturen: Welcher Ansatz ist der richtige für euch?"
Christoph Windheuser, Databricks

Wer Big-Data-Anwendungsfälle wie Analytics, Machine Learning oder GenAI umsetzen will, braucht die passende Datenarchitektur. Diese hat sich im Laufe der Zeit erheblich weiterentwickelt: von den ersten Data Warehouses bis hin zu modernen, cloudbasierten Daten- und Compute-Plattformen.

Dieser Vortrag wirft einen Blick auf Geschichte, Gegenwart und Zukunft von Big-Data-Architekturen. Im Mittelpunkt stehen konkurrierende Architekturmuster wie Data Mesh, Data Lakehouse, Data Fabric und Data Vault: Was unterscheidet sie, wo liegen die typischen Stolpersteine bei der Einführung, und wohin entwickelt sich das Feld?

Über Christoph:
Christoph Windheuser leitet bei Databricks ein Team von Big-Data- und KI-Lösungsarchitekten für Zentraleuropa und hilft Unternehmen dabei, herausfordernde Big-Data- und KI-Probleme zu lösen. Vor seinem Einstieg bei Databricks studierte er Informatik in Bonn, Pittsburgh, Tokio und Paris und schloss seine akademische Laufbahn mit einer Promotion im Bereich Spracherkennung mit Machine Learning an der E.N.S.T. in Paris ab. Anschließend arbeitete er über 25 Jahre in der IT-Branche, unter anderem bei SAP, Capgemini und ThoughtWorks. Big Data und KI sind seine Leidenschaft, über die er regelmäßig schreibt und spricht.

Datenprodukte richtig schneiden: Heuristiken für die Praxis
Stefan Negele, INNOQ

Datenprodukte sind ein zentrales Konzept moderner Datenarchitekturen. Doch wie groß sollte ein Datenprodukt eigentlich sein? Sind sie zu klein, müssen Konsumenten mühsam Daten zusammenstückeln. Ist es zu groß, verliert das Produkt seinen klaren Zweck und seine Ownership. Dieser Talk gibt eine praxisnahe Einführung in Datenprodukte und die Architekturen, in denen sie eingesetzt werden. Anschließend werden konkrete Heuristiken für den richtigen Schnitt vorgestellt, die nach den drei Archetypen source-aligned, aggregate und consumer-aligned gegliedert sind. Der Fokus liegt auf anwendbaren Leitfragen, die Teams dabei helfen, bessere Entscheidungen beim Design ihrer Datenprodukte zu treffen.

Über Stefan:
Stefan Negele ist seit 2012 als Softwareentwickler und Architekt in verschiedenen Unternehmen tätig und seit 2023 als Berater bei INNOQ. Seine Schwerpunkte liegen in den Bereichen Datenarchitekturen, Data Governance und verteilte Systeme.

Begrenzte Plätze: Bitte updatet eure Teilnahme, wenn etwas dazwischenkommt. Das erleichtert uns die Planung!

Thu 21 May · 16:30< 50