Kandungan disediakan oleh Brian Olsen and Trino Community. Semua kandungan podcast termasuk episod, grafik dan perihalan podcast dimuat naik dan disediakan terus oleh Brian Olsen and Trino Community atau rakan kongsi platform podcast mereka. Jika anda percaya seseorang menggunakan karya berhak cipta anda tanpa kebenaran anda, anda boleh mengikuti proses yang digariskan di sini https://ms.player.fm/legal.
Player FM - Aplikasi Podcast Pergi ke luar talian dengan aplikasi Player FM !
We're trying something different this week: a full post-show breakdown of every episode in the latest season of Black Mirror! Ari Romero is joined by Tudum's Black Mirror expert, Keisha Hatchett, to give you all the nuance, the insider commentary, and the details you might have missed in this incredible new season. Plus commentary from creator & showrunner Charlie Brooker! SPOILER ALERT: We're talking about the new season in detail and revealing key plot points. If you haven't watched yet, and you don't want to know what happens, turn back now! You can watch all seven seasons of Black Mirror now in your personalized virtual theater . Follow Netflix Podcasts and read more about Black Mirror on Tudum.com .…
Kandungan disediakan oleh Brian Olsen and Trino Community. Semua kandungan podcast termasuk episod, grafik dan perihalan podcast dimuat naik dan disediakan terus oleh Brian Olsen and Trino Community atau rakan kongsi platform podcast mereka. Jika anda percaya seseorang menggunakan karya berhak cipta anda tanpa kebenaran anda, anda boleh mengikuti proses yang digariskan di sini https://ms.player.fm/legal.
Kandungan disediakan oleh Brian Olsen and Trino Community. Semua kandungan podcast termasuk episod, grafik dan perihalan podcast dimuat naik dan disediakan terus oleh Brian Olsen and Trino Community atau rakan kongsi platform podcast mereka. Jika anda percaya seseorang menggunakan karya berhak cipta anda tanpa kebenaran anda, anda boleh mengikuti proses yang digariskan di sini https://ms.player.fm/legal.
Jan Waś teaches us about the new Faker connector and how you can use it to emulate data that does not exist on any storage, how you can shape it as you need, and how you can then learn real SQL, build real reports, and make some real charts - all with fake data. Details at https://trino.io/episodes/71…
Manfred Moser is joined by Peter Kosztolanyi to talk about the origins, current status, and future of the new Preview Web UI for Trino, before we play around with it in a demo. More info at https://trino.io/episodes/70
Manfred is joined by Wren AI team members and contributors to talk about the new AI-powered, text to SQL tool and its great support for Trino. More details at https://trino.io/episodes/66
Sebastian Bernauer and Sönke Liebau from Stackable join us to talk about their experience with using Open Policy Agent for access control with Trino. More details at https://trino.io/episodes/64
Emily Sunaryo, DevRel intern at Starburst, joins us to talk about her experience learning Trino and starting to write a web application with JavaScript to query data in Trino. More details at https://trino.io/episodes/63
Cole and Manfred talk with our guest Patrick Pichler from CreativeData about PowerBI and his open source Trino connector. More details in https://trino.io/episodes/61
We chat with Isa Inalcik from BestSecret about his proof of concepts for Trino functions calling AI/LLM systems. More details at https://trino.io/episodes/60
Interview with David Phillips and Matt Stepenson about OpenTelemetry integration in Trino and usage with Starburst Galaxy. More details at https://trino.io/episodes/57
Interview with Colleen Tartow and Roman Zeyde from VAST about the database and platform and how it works with Trino. More details at https://trino.io/episodes/56
We chat with Matthew Mullins and Roman Nesterov about Coginiti, a enterprise SQL workbench and collaboration tool with Trino support. More information at https://trino.io/episodes/53
When you're running a world-class, highly-performant query engine like Trino and investing time and resources into maintaining it, wouldn't it be great to treat your queries as a first-class, collaborative, versioned system, too? Enter PopSQL, a SQL client that makes it easier than ever to work together on Trino queries, collaborate with teammates, and visualize your results in a quick and easy way. We'll talk about how PopSQL integrates with Trino, what it can do for Trino users, and dive into the new Node adapter they built to be the most powerful way to use Trino in a web UI.…
50 episodes is a big number! With the episodes coming once a month, this has been years in the making, so we're inviting back Trino co-founder Dain Sundstrom and Trino/Iceberg Developer Advocate Brian Olsen to catch up, share some stories, and talk about what it takes to maintain an open source project for so long.…
We heard from Phillip Cloud at Voltron Data during Trino Fest, but in this episode, we take a deeper dive into what Ibis is, how you can use it with Trino, and why you'd want to do that.
We’ve put out nearly 50 Trino Community Broadcast episodes, but we haven’t yet done the simplest, most obvious topic of them all - an exploration of what Trino is, how Trino works, and how you can run it. This week, we’re taking a step back and doing a broader overview of those things, because the world needs to know… what is Trino?…
Just a few weeks ago, Trino added two new maintainers to help govern its open source codebase: James Petty from AWS, and Manfred Moser from Starburst Data. It's not every day that new maintainers are added to an open source project, so let's take some time to get to know them, their history with Trino, and how they ended up here!…
DolphinScheduler is a popular Apache data workflow orchestrator that enables running complex data pipelines. They recently added a Trino integration and will be demonstrating how to use DolphinScheduler to enable a series of transformations on the data lakehouse with Trino. - Intro Music: 0:00 - Intro: 0:31 - Trino release 407: 13:22 - What is workflow orchestration?: 21:12 - Why do we need a workflow orchestration tool for building a data lake?: 31:07 - What is Apache DolphinScheduler?: 37:35 - Does DolphinScheduler have any computing engine or storage layer?: 53:11 - What are the differences with other workflow orchestration, such as Apache Airflow?: 58:46 - Demo: Creating a simple Trino workflow in DolphinScheduler: 1:26:44 - PR: Improve performance of Parquet files: 1:47:04 Show Notes: https://trino.io/episodes/45 Show Page: https://trino.io/broadcast/…
In episode 44 we have guests from Starburst and Metabase. with us for the show. They collaborated on creating support for Metabase with Trino and made it available as open source project. We will dive into how this all came together, learn about some cool features of the combination of Metabase and Trino and just generally geek out. Show notes: https://trino.io/episodes/44 .html Show page: https://trino.io/broadcast/…
Let's face it, out of all those petabytes of data you've been hoarding, only a small fraction of it is creating business value for you today. When you scan the same data multiple times and transfer it over the wire, you're wasting time, compute cycles, and ultimately money. This gets worse when you're pulling data across regions or clouds from disaggregate Trino clusters. In situations like these, caching solutions like Alluxio can make a tremendous impact on the latency and cost of your queries. In this episode, we sit down with Alluxio PMC members, Bin Fan and Beinan Wang, to discuss Alluxio, its use cases, architecture, and how it fits into the Trino architecture. Show Notes: https://trino.io/episodes/43.html Show Page: https://trino.io/broadcast/…
We're going to discuss all of the awesome sessions that happened during Trino Summit this year. Manfred, Cole, and I will be joined by Martin, Dain, Brian Zhan, and Claudius for their, perspective and what they found most interesting about the summit. We also dive into stats around the summit and some exciting topics discussed off-camera. We'll also dive into some key takeaways from the Trino Contributor Congregation that took place the day after and some of the topics we went over there. - Intro Music: 0:00 - Intro: 0:32 - Trino Summit intro: 1:46 - Why the Pokémon theme?: 3:56 - Overview of Trino Summit and what stood out: 10:58 - Bringing Trino to the masses: 33:57 - Trino Contributor Congregation recap: 43:52 - Releases: 1:00:01 - Backlog grooming: 1:03:51 Show Notes: https://trino.io/episodes/42.html Show Page: https://trino.io/broadcast/…
Trino's initial use case was around replacing the Apache Hive runtime. As data lakes grew into prominence, it became clear that having a faster query engine didn't solve all problems. The Hive model itself was a huge bottleneck and didn't provide features that companies needed akin to data warehouses and databases. Apache Hudi is a new table format created out of Uber that aims to address many of these issues and usher in a new generation of data lake. Tune in as we speak to the Trino Hudi connector contributor Sagar Sumit and Grace Lu who uses Trino and Hudi at Robinhood to discuss the new Hudi connector and future plans! - Intro Music: 0:00 - Intro: 0:32 - Releases: 14:43 - Concept of the episode: Intro to Hudi and the Hudi connector: 22:29 - Concept of the episode: Merge on read and copy on write tables: 28:28 - Concept of the episode: Hudi metadata table: 39:24 - Concept of the episode: Hudi data layout: 46:39 - Concept of the episode: Robinhood Trino and Hudi use cases: 51:12 - Concept of the episode: Current state and roadmap for the Hudi connector: 1:03:15 - Pull request of the episode: PR 14445: Fault-tolerant execution for PostgreSQL and MySQL connectors: 1:08:14 - Demo of the episode: Using the Hudi Connector: 1:13:34 Show Notes: https://trino.io/episodes/41.html Show Page: https://trino.io/broadcast/…
Join us for this next episode of the broadcast, where we bring back Ryan Blue, the creator of Iceberg, to discuss some of the latest happenings in the Iceberg community. We also discuss and demo a bunch of new features that have come out in the Trino Iceberg connector. We also have a new guest, Tabular Developer Advocate Sam Redai, shedding light on this incredible community as well! Since the first episodes, Iceberg has finalized the v2 spec and added a lot of new features along the way. Likewise, we've improved Trino's writing capabilities around Iceberg. So much so that you can use Trino as the sole query engine atop Iceberg to support your data lake. We'll talk about all of this and more so don't miss it! - Intro Music: 0:00 - Intro: 0:32 - Releases: 6:27 - Concept of the episode: What is Iceberg?: 11:27 - Concept of the episode: Why Iceberg over other formats?: 16:50 - Concept of the episode: Metadata catalogs: 35:40 - Concept of the episode: Branching, tagging, and auditing, oh my!: 43:54 - Concept of the episode: The Puffin format: 50:53 - Concept of the episode: Trino Iceberg connector updates : 1:01:38 - Pull request of the episode: PR 13111: Scale table writers per task based on throughput: 1:11:37 - Demo of the episode: DML operations on Iceberg using Trino: 1:15:31 Show Notes: https://trino.io/episodes/40.html Show Page: https://trino.io/broadcast/…
In this episode we sit down with engineers, Steve Morgan and Edward Morgan, to discuss how they use Trino at Raft. Raft provides consulting services and is particularly skilled at DevSecOps. One particular challenge they face is dealing with fragmented government infrastructure. In this episode, we dive in to learn how Trino enables Raft to supply government sector clients with a data fabric solution. Raft takes a special stance on using and contributing to open source solutions that run well on the cloud. - Intro: 0:00 - Intro: 0:32 - Releases: 5:14 - Concept of the episode: Trino at Raft: 12:08 - Concept of the episode: Software factory: 13:21 - Concept of the episode: Standards and anatomy of a stack: 16:24 - Concept of the episode: Data Fabric at Raft: 18:58 - Concept of the episode: Security concerns around Trino: 22:01 - Concept of the episode: Iron bank container repository : 29:27 - Concept of the episode: Data Fabric user perspective : 36:13 - Concept of the episode: Challenges for adoption : 45:52 - Pull request of the episode: PR 13354: Add S3 Select pushdown for JSON files: 1:05:34 - Demo of the episode: Running Great Expectations on a Trino Data Lakehouse Tutorial: 1:08:01 - Question of the episode: How can I deploy Trino on Kubernetes without using Helm chart?: 1:13:55 Show Notes: https://trino.io/episodes/39.html Show Page: https://trino.io/broadcast/…
We'll be doing a more focused look at a specific feature that's being added to Trino: polymorphic table functions. We're excited to talk about what they do, where we are so far, where we're going, and how you can leverage them to make Trino better than ever! Show Notes: https://trino.io/episodes/38.html Show Page: https://trino.io/broadcast/ YouTube Video: https://www.youtube.com/watch?v=90e5WxhwNas…
This episode covers will introduce the benefits of having the Trino community around the Trino project. What is the purpose of communities in tech projects? Would the product be successful without a community or anyone to maintain it? We introduce some new faces that will be stewards in our journey to growing the adoption of our favorite query engine, what each of them does, and how their work impacts you as a community member! Most importantly, you can learn how to get involved and help us learn how to best navigate ideas, issues, or any other contribution you may have that helps make our favorite query engine the best-in-class! - Intro song: 00:00 - Intro: 00:32 - Releases: 9:37 - Concept of the episode: How to strengthen the Trino community: 15:07 - Concept of the episode: Pull request process: 30:33 - Concept of the episode: Impact of community and developer experience: 33:07 - Concept of the episode: Community metrics for better decision making: 44:00 - Pull requests of the episode: PR 12259: Support updating Iceberg table partitioning: 1:09:42 - Demo of the episode: Iceberg table partition migrations: 1:16:00 - Question of the episode: Can I force a pushdown join into a connected data source?: 1:28:40 Show Notes: https://trino.io/episodes/37.html Show Page: https://trino.io/broadcast/…
As Trino preps to jump to Java 17, we discuss the latest features added Java 11 to Java 17, talk with Martin through a few of the potential uses of new features like the Vector API, language improvements, and G1GC speedups, and finally, we will dive into discussing some of the features that we'll be implementing in the upcoming months under a new project in Trino! - Intro song: 00:00 - Intro: 00:36 - Releases: 8:17 - Question of the episode: Will Trino be making a vectorized C++ version of Trino workers?: 19:22 - Concept of the episode: Java 17 and rearchitecting Trino: 36:39 - Java 17 Updates: Performance: 40:10 - Java 17 Updates: Garbage collectors: 46:45 - Java 17 Updates: Java auto-vectorization: 1:06:22 - Java 17 Updates: Java Vector API: 1:12:08 - Java 17 Updates: Language features: 1:17:14 - Rearchitecting Trino: Update to Java 17: 1:27:19 - Rearchitecting Trino: Revamping Trino: 1:32:36 - Rearchitecting Trino: Project Hummingbird: 1:39:40 - Pull requests of the episode: PR 4649: Disable JIT byte code recompilation cutoffs in default jvm.config: 1:42:31 - Demo of the episode: FizzBuzz - SIMD style!: 1:49:18 Show Notes: https://trino.io/episodes/36.html Show Page: https://trino.io/broadcast/…
In our Trino Community Broadcast episode 35 we are catching up on recent releases 375, 376, 377, and 378. We then talk about how Trino is packaged as tarball, rpm, and docker container, what some of the differences are, and how you can customize either of them. Beyond we also look for your feedback and input on usage of the different packages. As a next step we chat about adopting Java 17 is standard for Trino, and then we get a demo of a new feature of the web UI. - Intro song: 00:00 - Intro: 00:32 - Releases: 4:22 - Concept of the episode: Packaging Trino: 21:28 - Additional topic of the episode: Modernizing Trino with Java 17: 46:49 - Pull requests of the episode: Worker stats in the Web UI: 55:25 - Question of the episode: HDFS supported by Delta Lake connector?: 1:01:52 - Demo of the episode: Tarball installation and new Web UI feature: 1:05:58 Show Notes: https://trino.io/episodes/35.html Show Page: https://trino.io/broadcast/…
News from the Trino releases 372, 373, and 374, and an update on Project Tardigrade are the start. Then we dive into the details of the new Delta Lake connector contributed to Trino by Starburst. - Intro song: 00:00 - Intro: 00:37 - Releases: 2:05 - Project Tardigrade update: 9:21 - Concept of the episode: A new connector for Delta Lake object storage. 18:37 - Pull requests of the episode: Add Delta Lake connector and documentation. 26:10 - Demo of the episode: Delta Lake connector in action. 29:14 - Question of the episode: How do I secure the connection from a Trino cluster to the data source? 54:00 Show Notes: https://trino.io/episodes/33.html Show Page: https://trino.io/broadcast/…
Goldman Sachs uses Trino to reduce last-mile ETL and provide a unified way of accessing data through federated joins. Making a variety of data sets from different sources available in one spot for our data science team was a tall order. Data must be quickly accessible to data consumers and systems like Trino must be reliable for users to trust this singular access point for their data. Join us on this next episode as we discuss with engineers from Goldman Sachs on how they integrated Trino and achieved scaling and high availability. - Intro Song: 00:00 - Intro: 00:28 - News: 8:39 - Concept of the month: High Availability with Trino: 20:23 - PR of the month: PR 8956 Add support for external db for schema management in mongodb connector: 1:04:09 - Bonus PR of the month: PR 8202 Metadata for alias in elasticsearch connector only uses the first mapping: 1:15:15 - Demo of the month: Trino Fiddle: A tool for easy online testing and sharing of Trino SQL problems and their solutions: 1:32:08 - Question of the month: Does trino hive connector supports CarbonData?: 1:38:09 Show Notes: https://trino.io/episodes/33.html Show Page: https://trino.io/broadcast/…
- Intro Song: 00:00 - Intro: 00:33 - News: 8:02 - Concept of the week: K8s architecture: Containers, Pods, and kubelets: 14:27 - PR of the week: PR 11 Merge contributor version of k8s charts with the community version: 55:20 - Demo: Running the Trino charts with kubectl: 57:42 Show Notes: https://trino.io/episodes/24.html Show Page: https://trino.io/broadcast/…
- Intro Song: 00:00 - Intro: 00:34 - News: 5:18 - Concept of the week: Row pattern matching and MATCH_RECOGNIZE: 14:26 - PR of the week: PR 8348 Document row pattern recognition in window: 52:16 - Demo: Showing MATCH_RECOGNIZE functionality by example: 57:13 - Question of the week: How do you tag a list of rows with custom periodic rules?: 1:12:51 Show Notes: https://trino.io/episodes/23.html Show Page: https://trino.io/broadcast/…
This episode will cover LinkedIn's journey to upgrade from PrestoSQL to Trino and some of the operational challenges LinkedIn's engineering team has faced at their scale. - Intro Song: 00:00 - Intro: 00:36 - News: 7:39 - Concept of the week: Trino usage at LinkedIn: 15:55 - Concept of the week: Trino hardware and operational scale: 23:23 - Concept of the week: Challenges operating at scale: 44:09 - Concept of the week: Open source at LinkedIn: 48:36 - Concept of the week: PrestoSQL to Trino upgrade challenges: 58:11 - Concept of the week: PrestoSQL to Trino upgrade steps: 1:13:32 - PR of the week: Digging into join queries: 1:33:18 - Demo: How to research the performance of a join: 1:38:53 - Question of the week: How can I query the Hive views from Trino?: 1:48:10 Show Notes: https://trino.io/episodes/22.html Show Page: https://trino.io/broadcast/…
- Intro Song: 00:00 - Intro: 00:35 - News: 7:42 - Question of the week: Can dbt connect to different databases in the same project?: 18:18 - Concept of the week: What is dbt?: 21:28 - Concept of the week: dbt + Trino: 38:09 - Demo: Querying Trino from a dbt project: 47:21 - PR of the week: PR 8283 Externalised destination table cache expiry duration for BigQuery Connector: 1:21:13 Show Notes: https://trino.io/episodes/21.html Show Page: https://trino.io/broadcast/…
- Intro Song: 00:00 - Intro: 00:35 - News: 10:16 - Concept of the week: Trino for the Trinewbie: 19:12 - Concept of the week: Marius' Journey: 21:03 - Concept of the week: Contributing to Trino: 54:55 - PR of the week: PR 8135 Set default time zone for the current session: 1:03:36 - Demo: Contributing to Trino: 1:11:49 - Question of the week: How do I search nested objects in Elasticsearch from Trino?: 1:24:24 We didn't have time to run through the demo. I created another video outside of the show if you want need help with the contribution process: https://www.youtube.com/watch?v=gAqYkR2oGgM Show Notes: https://trino.io/episodes/20.html Show Page: https://trino.io/broadcast/…
- Intro Song: 00:00 - Intro: 00:37 - News: 7:56 - Concept of the week: Ingesting into Iceberg with Pulsar and Flink at BlueCat: 17:30 - Concept of the week: BlueCat Overview: 20:31 - Concept of the week: Single Tenant to Multi-Tenant: 21:33 - Concept of the week: Pre-Iceberg: 26:13 - Concept of the week: Iceberg: 39:29 - PR of the week: PR 1905 Add format_number function: 1:01:55 - Demo: Showing the format_number functionality: 1:04:38 - Question of the week: How do I search nested objects in Elasticsearch from Trino?: 1:08:54 Show Notes: https://trino.io/episodes/19.html Show Page: https://trino.io/broadcast/…
Selamat datang ke Player FM
Player FM mengimbas laman-laman web bagi podcast berkualiti tinggi untuk anda nikmati sekarang. Ia merupakan aplikasi podcast terbaik dan berfungsi untuk Android, iPhone, dan web. Daftar untuk melaraskan langganan merentasi peranti.
Sertai aplikasi podcast terbaik di dunia untuk mengurus rancangan kegemaran anda di talian dan mainkan ia di luar talian dengan aplikasi Android dan iOS. Ia percuma dan mudah!