DataTalks.Club - the place to talk about data!
…
continue reading
1
Community Building and Teaching in AI & Tech - Erum Afzal
50:01
50:01
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
50:01
Links: LinkedIn: https://www.linkedin.com/in/erum-afzal-64827b24/ Twitter: https://twitter.com/Erum55449739 Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcampJoin DataTalks.Club: https://datatalks.club/slack.htmlOur events: https://datatalks.club/events.html
…
continue reading
1
Working in Open Source - Probabl.ai and sklearn - Vincent Warmerdam
52:02
52:02
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
52:02
Links: probabl. YouTube channel: https://www.youtube.com/@UCIat2Cdg661wF5DQDWTQAmg Calmcode website: https://calmcode.io/ probabl. website: https://probabl.ai/ Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcampJoin DataTalks.Club: https://datatalks.club/slack.htmlOur events: https://datatalks.club/events.html…
…
continue reading
1
AI for Ecology, Biodiversity, and Conservation - Tanya Berger-Wolf
51:47
51:47
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
51:47
Links: Biodiversity and Artificial Intelligence pdf: https://www.gpai.ai/projects/responsible-ai/environment/biodiversity-and-AI-opportunities-recommendations-for-action.pdf Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcampJoin DataTalks.Club: https://datatalks.club/slack.htmlOur events: https://datatalks.club…
…
continue reading
1
Knowledge Graphs and LLMs Across Academia and Industry - Anahita Pakiman
53:14
53:14
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
53:14
Links: GitHub repo: https://github.com/antahiap/ADPT-LRN-PHYS/tree/main Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcampJoin DataTalks.Club: https://datatalks.club/slack.htmlOur events: https://datatalks.club/events.html
…
continue reading
1
Inclusive Data Leadership Coaching - Tereza Iofciu
48:16
48:16
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
48:16
We talked about: Tereza’s background Switching from an Individual Contributor to Lead Python Pizza and the pizza management metaphor Learning to figure things out on your own and how to receive feedback Tereza as a leadership coach Podcasts Tereza’s coaching framework (selling yourself vs bragging) The importance of retrospectives The importance of…
…
continue reading
1
Building Production Search Systems - Daniel Svonava
58:25
58:25
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
58:25
Links: VectorHub: https://superlinked.com/vectorhub/?utm_source=community&utm_medium=podcast&utm_campaign=datatalks Daniel's LinkedIn: https://www.linkedin.com/in/svonava/ Free Data Engineering course: https://github.com/DataTalksClub/data-engineering-zoomcampJoin DataTalks.Club: https://datatalks.club/slack.htmlOur events: https://datatalks.club/e…
…
continue reading
1
Building Machine Learning Products - Reem Mahmoud
56:48
56:48
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
56:48
We talked about: Reem’s background Context-aware sensing and transfer learning Shifting focus from PhD to industry Reem’s experience with startups and dealing with prejudices towards PhDs AI interviewing solution How candidates react to getting interviewed by an AI avatar End-to-end overview of a machine learning project The pitfalls of using LLMs …
…
continue reading
1
Make an Impact Through Volunteering Open Source Work - Sara EL-ATEIF
55:56
55:56
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
55:56
We talked about: Sara’s background On being a Google PhD fellow Sara’s volunteer work Finding AI volunteer work Sara’s Fruit Punch challenge How to take part in AI challenges AI Wonder Girls Hackathons Things people often miss in AI projects and hackathons Getting creative Fostering your social media Tips on applying for volunteer projects Why it’s…
…
continue reading
1
Accelerating The Job Hunt for The Perfect Job in Tech - Sarah Mestiri
53:04
53:04
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
53:04
We talked about: Sarah’s background How Sarah became a coach and found her niche Sarah’s clients How Sarah helps her clients find the perfect job Finding a specialization Informational interviews Building a connection for mutual benefit The networking strategy Listing your projects in the CV The importance of doing research yourself and establishin…
…
continue reading
1
Machine Learning Engineering in Finance - Nemanja Radojkovic
53:10
53:10
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
53:10
We talked about: Nemanja’s background When Nemanja first work as a data person Typical problems that ML Ops folks solve in the financial sector What Nemanja currently does as an ML Engineer The obstacle of implementing new things in financial sector companies Going through the hurdles of DevOps Working with an on-premises cluster “ML Ops on a Shoes…
…
continue reading
1
Stock Market Analysis with Python and Machine Learning - Ivan Brigida
55:31
55:31
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
55:31
We talked about: Ivan’s background How Ivan became interested in investing Getting financial data to run simulations Open, High, Low, Close, Volume Risk management strategy Testing your trading strategies Sticking to your strategy Important metrics and remembering about trading fees Important features Deployment How DataTalks.Club courses helped Iv…
…
continue reading
1
Bayesian Modeling and Probabilistic Programming - Rob Zinkov
54:15
54:15
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
54:15
We talked about: Rob’s background Going from software engineering to Bayesian modeling Frequentist vs Bayesian modeling approach About integrals Probabilistic programming and samplers MCMC and Hakaru Language vs library Encoding dependencies and relationships into a model Stan, HMC (Hamiltonian Monte Carlo) , and NUTS Sources for learning about Bay…
…
continue reading
1
Navigating Challenges and Innovations in Search Technologies - Atita Arora
57:00
57:00
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
57:00
We talked about: Atita’s background How NLP relates to search Atita’s experience with Lucidworks and OpenSource Connections Atita’s experience with Qdrant and vector databases Utilizing vector search Major changes to search Atita has noticed throughout her career RAG (Retrieval-Augmented Generation) Building a chatbot out of transcripts with LLMs I…
…
continue reading
1
The Entrepreneurship Journey: From Freelancing to Starting a Company - Adrian Brudaru
56:21
56:21
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
56:21
We talked about: Adrian’s background The benefits of freelancing Having an agency vs freelancing What let Adrian switch over from freelancing The conception of DLT (Growth Full Stack) The investment required to start a company Growth through the provision of services Growth through teaching (product-market fit) Moving on to creating docs Adrian’s c…
…
continue reading
1
Become a Data Freelancer - Dimitri Visnadi
55:13
55:13
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
55:13
We talked about: Dimitri’s background The first steps of transitioning into freelance Working with recruiters (contracting) Deciding on what to charge for your services Establishing your network Self-marketing Contracting vs freelancing Which channel is better for those starting out? Cutting out the middleman Where to look for clients and how to ve…
…
continue reading
1
AI for Digital Health - Maria Bruckert
50:24
50:24
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
50:24
We talked about: Maria’s background Deciding to go into telecare (healthcare) Current difficulties in healthcare Getting into the healthcare industry as a lifestyle brand The importance of a plan B and being flexible What is SQIN and the importance of communication Going from lipstick to skin health analysis The importance of community and broadeni…
…
continue reading
1
Cracking the Code: Machine Learning Made Understandable - Christoph Molnar
51:59
51:59
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
51:59
We talked about: Christoph’s background Kaggle and other competitions How Christoph became interested in interpretable machine learning Interpretability vs Accuracy Christoph’s current competition engagement How Christoph chooses topics for books Why Christoph started the writing journey with a book Self-publishing vs via a publisher Christoph’s ot…
…
continue reading
1
The Unwritten Rules for Success in Machine Learning - Jack Blandin
50:26
50:26
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
50:26
We talked about: Jack’s background Transitioning from IC to management Lesson not taught in traditional school The importance of people’s perception, trust, and respect How soft skills are relevant to machine learning How to put on a salesman hat in machine learning management The importance of visuals and building a POC as fast as possible 1st Rul…
…
continue reading
1
From a Research Scientist at Amazon to a Machine learning/AI Consultant - Verena Webber
54:55
54:55
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
54:55
Links: Mini sound bath: https://www.youtube.com/watch?v=g-lDrcSqcrQ Free ML Engineering course: http://mlzoomcamp.comJoin DataTalks.Club: https://datatalks.club/slack.htmlOur events: https://datatalks.club/events.html
…
continue reading
1
From Marketing to Product Owner in Search - Lera Kaimashnіkova
55:14
55:14
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
55:14
We talked about: Lera’s background Lera’s move from Ukraine to Germany The transition from Marketing to Product Ownership The importance of communication and one-on-ones The role of Product Owner Utilizing Scrum as a Product Owner Building teams and cross-functionality Lera’s experience learning about search The importance of having both technical …
…
continue reading
1
Collaborative Data Science in Business - Ioannis Mesionis
55:50
55:50
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
55:50
Links: LinkedIn: https://www.linkedin.com/in/ioannis-mesionis/ Github: https://github.com/ioannismesionis Website: https://ioannismesionis.github.io/ Free ML Engineering course: http://mlzoomcamp.comJoin DataTalks.Club: https://datatalks.club/slack.htmlOur events: https://datatalks.club/events.html
…
continue reading
1
Bridging Data Science and Healthcare - Eleni Stamatelou
54:02
54:02
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
54:02
Free ML Engineering course: http://mlzoomcamp.comJoin DataTalks.Club: https://datatalks.club/slack.htmlOur events: https://datatalks.club/events.html
…
continue reading
1
DataTalks.Club Anniversary Interview - Alexey Grigorev, Johanna Bayer
57:44
57:44
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
57:44
Free ML Engineering course: http://mlzoomcamp.comJoin DataTalks.Club: https://datatalks.club/slack.htmlOur events: https://datatalks.club/events.html
…
continue reading
1
Data Engineering for Fraud Prevention - Angela Ramirez
54:14
54:14
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
54:14
We talked about: Angela's background Angela's role at Sam's Club The usefulness of knowing ML as a data engineer Angela's career path Transitioning from data analyst to data engineer/system designer Best practices for system design and data engineering Working with document databases Working with network-based databases Detecting fraud with a netwo…
…
continue reading
1
From Data Manager to Data Architect - Loïc Magnien
56:41
56:41
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
56:41
We talked about: Loïc's background Data management Loïc's transition to data engineer Challenges in the transition to data engineering What is a data architect? The output of a data architect's work Establishing metrics and dimensions The importance of communication Setting up best practices for the team Staying relevant and tech-watching Setting u…
…
continue reading
1
Pragmatic and Standardized MLOps - Maria Vechtomova
53:43
53:43
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
53:43
We talked about: Maria's background Marvelous MLOps Maria's definition of MLOps Alternate team setups without a central MLOps team Pragmatic vs non-pragmatic MLOps Must-have ML tools (categories) Maturity assessment What to start with in MLOps Standardized MLOps Convincing DevOps to implement Understanding what the tools are used for instead of kno…
…
continue reading
1
Democratizing Causality - Aleksander Molak
56:00
56:00
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
56:00
We talked about: Aleksander's background Aleksander as a Causal Ambassador Using causality to make decisions Counterfactuals and and Judea Pearl Meta-learners vs classical ML models Average treatment effect Reducing causal bias, the super efficient estimator, and model uplifting Metrics for evaluating a causal model vs a traditional ML model Is the…
…
continue reading
1
Mastering Data Engineering as a Remote Worker - José María Sánchez Salas
46:30
46:30
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
46:30
We talked about: José's background How José relocated to Norway and his schedule Tech companies in Norway and José role Challenges of working as a remote data engineer José's newsletter on how to make use of data The process of making data useful Where José gets inspiration for his newsletter Dealing with burnout When in Norway, do as the Norwegian…
…
continue reading
1
The Good, the Bad and the Ugly of GPT - Sandra Kublik
50:53
50:53
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
50:53
We talked about: Sandra's background Making a YouTube channel to break into the LLM space The business cases for LLMs LLMs as amplifiers The befits of keeping a human in the loop when using LLMs (AI limitations) Using LLMs as assistants Building an app that uses an LLM Prompt whisperers and how to improve your prompts Sandra's 7-day LLM experiment …
…
continue reading
1
LLMs for Everyone - Meryem Arik
55:28
55:28
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
55:28
We talked about: Meryam's background The constant evolution of startups How Meryam became interested in LLMs What is an LLM (generative vs non-generative models)? Why LLMs are important Open source models vs API models What TitanML does How fine-tuning a model helps in LLM use cases Fine-tuning generative models How generative models change the lan…
…
continue reading
1
Investing in Open-Source Data Tools - Bela Wiertz
54:57
54:57
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
54:57
We talked about: Bela's background Why startups even need investors Why open source is a viable go-to-market strategy Building a bottom-up community The investment thesis for the TKM Family Office and the blurriness of the funding round naming convention Angel investors vs VC Funds vs family offices Bela's investment criteria and GitHub stars as a …
…
continue reading
1
Why Machine Learning Design is Broken - Valerii Babushkin
51:20
51:20
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
51:20
Links: Book: https://www.manning.com/books/machine-learning-system-design?utm_source=AGMLBookcamp&utm_medium=affiliate&utm_campaign=book_babushkin_machine_4_25_23&utm_content=twitter Discount: poddatatalks21 (35% off) Evidently: https://www.evidentlyai.com/ Article: https://medium.com/people-ai-engineering/design-documents-for-ml-models-bbcd30402ff…
…
continue reading
1
Interpretable AI and ML - Polina Mosolova
52:47
52:47
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
52:47
We talked about: Polina's background How common it is for PhD students to build ML pipelines end-to-end Simultaneous PhD and industry experience Support from both the academic and industry sides How common the industrial PhD setup is and how to get into one Organizational trust theory How price relates to trust How trust relates to explainability T…
…
continue reading
1
From Scratch to Success: Building an MLOps Team and ML Platform - Simon Stiebellehner
53:33
53:33
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
53:33
We talked about: Simon's background What MLOps is and what it isn't Skills needed to build an ML platform that serves 100s of models Ranking the importance of skills The point where you should think about building an ML platform The importance of processes in ML platforms Weighing your options with SaaS platforms The exploratory setup, experiment t…
…
continue reading
1
From MLOps to DataOps - Santona Tuli
53:05
53:05
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
53:05
We talked about: Santona's background Focusing on data workflows Upsolver vs DBT ML pipelines vs Data pipelines MLOps vs DataOps Tools used for data pipelines and ML pipelines The “modern data stack” and today's data ecosystem Staging the data and the concept of a “lakehouse” Transforming the data after staging What happens after the modeling phase…
…
continue reading
1
Data Developer Relations - Hugo Bowne-Anderson
50:51
50:51
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
50:51
We talked about: Hugo's background Why do tools and the companies that run them have wildly different names Hugo's other projects beside Metaflow Transitioning from educator to DevRel What is DevRel? DevRel vs Marketing How DevRel coordinates with developers How DevRel coordinates with marketers What skills a DevRel needs The challenges that come w…
…
continue reading
1
Lessons Learned from Freelancing and Working in a Start-up - Antonis Stellas
50:30
50:30
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
50:30
We talked about; Antonis' background The pros and cons of working for a startup Useful skills for working at a startup and the Lean way to work How Antonis joined the DataTalks.Club community Suggestions for students joining the MLOps course Antonis contributing to Evidently AI How Antonis started freelancing Getting your first clients on Upwork Pr…
…
continue reading
1
Data Access Management - Bart Vandekerckhove
50:28
50:28
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
50:28
We talked about: Bart's background What is data governance? Data dictionaries and data lineage Data access management How to learn about data governance What skills are needed to do data governance effectively When an organization needs to start thinking about data governance Good data access management processes Data masking and the importance of …
…
continue reading
1
Data Strategy: Key Principles and Best Practices - Boyan Angelov
55:49
55:49
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
55:49
We talked about: Boyan's background What is data strategy? Due diligence and establishing a common goal Designing a data strategy Impact assessment, portfolio management, and DataOps Data products DataOps, Lean, and Agile Data Strategist vs Data Science Strategist The skills one needs to be a data strategist How does one become a data strategist? D…
…
continue reading
1
Practical Data Privacy - Katharine Jarmul
57:44
57:44
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
57:44
We talked about: Katharine's background Katharine's ML privacy startup GDPR, CCPA, and the “opt-in as the default” approach What is data privacy? Finding Katharine's book – Practical Data Privacy The various definitions of data privacy and “user profiles” Privacy engineering and privacy-enhancing technologies Why data privacy is important What is d…
…
continue reading
1
Building Scalable and Reliable Machine Learning Systems - Arseny Kravchenko
50:59
50:59
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
50:59
We talked about: Arseny's background Working on machine learning in startups What is Machine Learning System Design? Constraints and requirements Known unknowns vs unknown unknowns (Design stage) Writing a design document Technical problems vs product-oriented problems The solution part of the Design Document What motivated Arseny to write a book o…
…
continue reading
1
Building an Open-Source NLP Tool - Johannes Hötter
56:26
56:26
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
56:26
We talked about: Johannes’s background Johannes’s Open Source Spotlight demos – Refinery and Bricks The difficulties of working with natural language processing (NLP) Incorporating ChatGPT into a process as a heuristic What is Bricks? The process of starting a startup – Kern Making the decision to go with open source Pros and cons of launching as o…
…
continue reading
1
Navigating Industrial Data Challenges - Rosona Eldred
53:22
53:22
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
53:22
We talked about: Rosona’s background How mathematics knowledge helps in industry What is industrial data? Setting up an industrial process using blue paint Internet companies’ data vs industrial data Explaining industrial processes using packing peanuts Why productive industry needs data Measuring product qualities How data specialists use industri…
…
continue reading
1
Mastering Self-Learning in Machine Learning - Aaisha Muhammad
51:02
51:02
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
51:02
We talked about: Aaisha’s background How homeschooling affects self-study Deciding on what to learn about Establishing whether a resource is good How Aaisha focuses on learning Deciding on what kind of project to build Find research materials Aaisha’s experience with the Data Talks Club ML Zoomcamp ML Zoomcamp projects Aaisha’s interest in bioinfor…
…
continue reading
1
The Secret Sauce of Data Science Management - Shir Meir Lador
48:42
48:42
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
48:42
We talked about: Shir’s background Debrief culture The responsibilities of a group manager Defining the success of a DS manager The three pillars of data science management Managing up Managing down Managing across Managing data science teams vs business teams Scrum teams, brainstorming, and sprints The most important skills and strategies for DS a…
…
continue reading
1
SE4ML - Software Engineering for Machine Learning - Nadia Nahar
53:39
53:39
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
53:39
We talked about: Nadia’s background Academic research in software engineering Design patterns Software engineering for ML systems Problems that people in industry have with software engineering and ML Communication issues and setting requirements Artifact research in open source products Product vs model Nadia’s open source product dataset Failure …
…
continue reading
1
Starting a Consultancy in the Data Space - Aleksander Kruszelnicki
52:28
52:28
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
52:28
We talked about: Aleksander’s background The difficulty of selling data stack as a service How Aleksander got into consulting The Mom Test – extracting feedback from people User interviews Why Aleksander’s data stack as a service startup was not viable How Aleksander decided to switch to consulting Finding clients to consult Figuring out how to pos…
…
continue reading
1
Biohacking for Data Scientists and ML Engineers - Ruslan Shchuchkin
52:58
52:58
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
52:58
We talked about: Ruslan’s background Fighting procrastination and perfectionism What is biohacking? The role of dopamine and other hormones in daily life How meditation can help The influence light has on our bodies Behavioral biohacking Daylight lamps and using light to wake up Sleep cycles How nutrition affects productivity Measuring productivity…
…
continue reading
1
Analytics for a Better World - Parvathy Krishnan
54:34
54:34
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
54:34
We talked about: Parvathy’s background Brainstorming sessions with nonprofits to establish data maturity Example of an Analytics for a Better World project The overall data maturity situation of nonprofits vs private sector Solving the skill gap Publicly available content The Analytics for a Better World Academy The Academy’s target audience How re…
…
continue reading
1
Accelerating the Adoption of AI through Diversity - Dânia Meira
57:00
57:00
Main Kemudian
Main Kemudian
Senarai
Suka
Disukai
57:00
We talked about: Dania’s background Founding the AI Guild Datalift Summit Coming up with meetup topics Diversity in Berlin Other types of diversity besides gender The pitfalls of lacking diversity Creating an environment where people can safely share their experiences How the AI Guild helps organizations become more diverse How the AI guild finds w…
…
continue reading