• What I Wish I Knew: Transfer Reflections on Entering Berkeley Data Science (feat. Avani Gireesha, Hannah Brown, Jake Pastoria)
    May 2 2025

    Access the full transcript for this episode

    “I never thought I would find that sense of community here, especially as a transfer, because I've heard so much about the stereotypes…I think a club really helped combat that” —Avani Gireesha

    In our final episode of Season 9, we hear from three graduating UC Berkeley seniors, all of whom transferred from California community colleges into the Data Science major: Avani Gireesha, Hannah Brown, and Jake Pastoria. They reflect on their transitions from community college to Berkeley, discussing the clubs, research, and experiences they’ve gained in their two years here. Listen in as they offer advice for incoming transfer students on how to prepare academically, find community, and get the most out of their Berkeley experience!

    “I'm still not really used to the exam rigor here and how difficult it is, but that's totally okay. I feel challenged here, and it really pushes me to get out of my comfort zone and be a better student” —Hannah Brown

    “I think these classes change the way that I view education as a whole…I'll never forget opening up my first Data 8 Jupyter notebook and submitting it. Education here is really cool, and I think you should take all these classes, especially when the professors are absolute legends in Berkeley and just computer science and data science in general” —Jake Pastoria



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceeducation.substack.com
    Show More Show Less
    28 mins
  • Interdisciplinary Roots and Inclusive Pathways in Data Science (feat. Mike Ludkovski & Alex Franks)
    Apr 18 2025

    Access the full transcript for this episode

    “There's a famous quote by a statistician, John Tukey, who's often associated with sort of introducing and promoting the concept of exploratory data analysis. And his quote is that the best thing about being a statistician is that you get to play in everyone's backyard, by which he means, as a data scientist, you get to dabble in all of these different areas…the longer you work in statistics, data science and adjacent fields, you really start to see that all these stories around data that come up in different disciplines, they're actually linked through the language of statistics and mathematics. So when I start a new domain, I will usually try to start by reasoning by analogy” —Prof. Alex Franks

    In this week’s episode, we talk with Professors Mike Ludkovski and Alex Franks from UC Santa Barbara about their diverse research backgrounds—ranging from stochastic modeling to sports analytics—and how they shaped their approach to data science education. Mike and Alex discuss the value of co-teaching, designing interdisciplinary curriculum, and helping students connect theory to real-world practice. They also touch on some major initiatives aimed at expanding access to data science education, including the Southern California Consortium and the Pacific Alliance for Low-Income Inclusion.

    “We found out… the awareness of data science is vastly different across campuses within just a few miles of each other… we are trying to help different places stand up data science courses, programs, and share best practices. We organize events like datathons for high school and community college students” —Prof. Mike Ludkovski



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceeducation.substack.com
    Show More Show Less
    29 mins
  • Open Science, Jupyter, and Data Science Education (feat. Fernando Pérez)
    Apr 4 2025

    Access the full transcript for this episode

    “Lo nuevo que va a entrar al curso esta vez es la pregunta de qué hacemos con las herramientas de inteligencia artificial en este contexto. ¿Cómo? ¿Cómo usar? Yo no voy a pretender que eso no existe. Yo creo que es absurdo hoy en día imaginarnos que los estudiantes no lo van a usar. Prohibirles usar esas herramientas yo creo que es, es, es fútil. Entonces la pregunta mía es bueno, cómo le creo a los estudiantes un ambiente en el cual sepa que su privacidad está siendo respetada, que tienen acceso a herramientas que pueden usar potencialmente en su propio computador.”

    In our second Spanish-speaking episode of the podcast, Eric Van Dusen and special guest host Edwin Vargas Navarro sit down with Fernando Pérez, who is the Faculty Director of the Berkeley Institute for Data Science at UC Berkeley (BIDS), a Professor of Statistics, and co-founder of Project Jupyter and IPython. Fernando reflects on his path from physics to computational science, as well as the role of open-source tools and interactive computing in the development of Juptyer Notebooks. We touch on the evolution of Jupyter and how it furthers interdisciplinary and reproducible collaboration, and discuss Fernando’s teaching philosophy through courses like STAT 159, a course that emphasized reproducibility and collaborative computing. He speaks on the challenges of AI integration in education, and offers broader advice to fellow data science educators on how to approach this quickly-evolving landscape.

    En nuestro segundo episodio en español del podcast, Eric Van Dusen y el invitado especial Edwin Vargas Navarro conversan con Fernando Pérez, quien es el Director de Facultad del Berkeley Institute for Data Science (BIDS) en UC Berkeley, profesor de Estadística y cofundador de Project Jupyter e IPython. Fernando reflexiona sobre su camino desde la física hasta la ciencia computacional, así como el papel de las herramientas de código abierto y la computación interactiva en el desarrollo de los Jupyter Notebooks. Tocamos la evolución de Jupyter y cómo promueve la colaboración interdisciplinaria y reproducible, y discutimos la filosofía de enseñanza de Fernando a través de cursos como STAT 159, un curso que enfatizaba la reproducibilidad y la computación colaborativa. Él habla sobre los desafíos de la integración de IA en la educación, y ofrece consejos más amplios a los educadores de ciencia de datos sobre cómo abordar este panorama en rápida evolución.

    “Porque si bien la matemática puede ser la misma, el valor de la ciencia de datos es que no es puramente probabilidad estadística o álgebra lineal. Es que esos datos vienen de algún lugar concreto, vienen de una comunidad, vienen de un grupo de personas, se reflejan, reflejan aspectos de ese contexto local y las decisiones que se van a tomar sobre esos datos van a afectar a una comunidad local.”



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceeducation.substack.com
    Show More Show Less
    33 mins
  • Teaching Data Science in a Changing World: Judith Canner on Reform, Collaboration, and Social Good
    Mar 21 2025

    Access the full transcript for this episode

    “I think a lot of times, we focus on data science as a tech thing, right? Oh, you're going to go work for Meta. You're going to go work for Google. You're going to go work for insert tech company here or AI startup here. And for a lot of students, especially a lot of my students, they really want to contribute to their communities and give back, right? They're thinking about how to make their community stronger. And when we only focus on the tech approach, that's very sort of up here, over there, you know, they know they'll make good money. And so they might pursue that, but they don't realize that data science can be used for a lot of good as well. You can use it in ways that actually serve the community, serve the world, from helping develop algorithms that can read MRIs or other medical imaging data, to help diagnose some sort of disease or cancer, or to identify human rights violations by being able to search massive amounts of documentation.”

    Today, we sit down with Judith Canner, a professor of statistics at California State University, Monterey Bay. Judith begins by reflecting on her role in redesigning first-year mathematics and statistics courses in response to some of the CSU’s executive orders, which took away traditional remedial mathematics classes. She explains to listeners how co-requisite courses and active learning strategies help students succeed, as well as touches on the importance of quantitative reasoning across a variety of disciplines. She talks about the effectiveness of pair programming within her teaching strategies, and implores people to reframe data science as a tool for social impact rather than just a way to a high-paying traditional tech job. Judith ends off by reminding fellow data science educators that data science is constantly evolving, so educators shouldn’t be afraid to embrace change and collaboration.

    “Don't be afraid to take a chance. The reality is that data science is still a little undefined and still constantly changing. And working in the Cal State system, I'm often confined by the system itself, right? We have to work within multiple systems when it comes to curriculum, but I'm seeing more and more educators really taking risks and more and more folks really thinking about, can we do this a completely different way than we've always done it? And so, not being afraid to take those risks. Can we teach math in a way completely different than we've always done it? Being OK with letting go of the status quo…”



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceeducation.substack.com
    Show More Show Less
    28 mins
  • Beyond Calculations: Ani Adhikari on the Art and Philosophy of Data Science Education
    Mar 7 2025

    Access the full transcript for this episode

    “It was in the 1970s that David Friedman and his colleagues completely changed the way statistics is taught in the world, from going from just an emphasis on calculation, calculation, calculation, without really paying any attention to, what's the question, and what can you do with the answer?… Why does anyone care? What is the calculation that you can justifiably do, given the information at hand? And then how do you interpret the answer? That is traditional statistics teaching, and I haven't strayed one step away from it. I'm still there. It's called data science now. The tools are different. And because the tools are different, we are empowered to ask questions that we wouldn't have dared to ask before. And we can answer it in ways that we couldn't before. But I still think I am teaching traditional statistics.”

    Today, we sit down with Ani Adhikari, a pioneer in building data science at UC Berkeley. She explains that traditional statistics education at Berkeley has always emphasized conceptual understanding, which she continues to aim to bring to the data science curriculum. Discussing teaching methods, she reassures statistics educators transitioning into data science that they don’t need to fundamentally change their approach—just the tools they use. Looking towards the future, Ani emphasizes AI’s rapid development, stressing the importance of equipping students with fundamental reasoning skills that will remain relevant regardless of how the industry continues to change. She ends by urging fellow educators to respect the history of data science, build on it, and remain aligned with their own intellectual and philosophical teaching goals.

    “Think about why: why are you wanting to be a data science educator? That answer will be very different for many people. But trying to get to the core of that answer is the key. What is your intellectual, philosophical reason? And then make sure that everything you do, you always ask yourself: am I achieving those philosophical intellectual goals that I had? And please, please, please respect the history. Do not think of data science education as something brand new. It has been happening since people started making decisions… Know the history, respect the history, and build on it. And then you will be fulfilled, and so will your students be.”



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceeducation.substack.com
    Show More Show Less
    18 mins
  • The AI Revolution in Data Science (feat. Paul Groth)
    Feb 21 2025

    Access the full transcript for this episode

    “I think one of the things we've approached in our data science curriculum is this idea that data science is a team sport…You're never really doing data science on your own. You're always in a team and you're working with product managers. You're working with end users. You're working with software engineers. You're working with salespeople. And that idea of how do I translate people's problems? What is my system going to do? What are the variables and what considerations I have when I'm designing a system with people? What are the algorithms going to do and what does that mean? So that kind of idea of treating it as a team sport and figuring that out as a student, is like a fundamental principle for how we do data science in these environments.”

    In this episode, we sit down with Paul Groth, Professor of Algorithmic Data Science at the University of Amsterdam. Throughout the episode, Paul shares the structure of data science and AI education in the Netherlands, highlighting how the Netherlands had AI undergraduate programs before data science became mainstream. He touches on the differences between AI education and data science, as well as his thoughts on treating scholarly publishing as structured data in his role as co-scientific director of the Discovery Lab. Finally, he ends with his approach to teaching and mentorship, approaching data science as a “team sport”.

    “In the Netherlands, we taught AI before we ever taught data science. So actually, we have I think one of the first few places in the world where we have a bachelor's in AI. So I think it’s different from the US system, where you might start off with a general curriculum and then specialize in something like computer science. Here, our undergraduate students come straight from high school and go directly into a subject. So we have, for example, they can go directly into AI and then they'll do a three-year bachelor's in AI.”



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceeducation.substack.com
    Show More Show Less
    17 mins
  • Empowering Community College Students Through Data Science (feat. Nathalie Guebels)
    Feb 7 2025

    Access the full transcript for this episode

    “A lot of times, you think as a young woman, well, I want to do something where I can care for the world, where I can make a difference in the world to make it better. Women often don't relate that to computer programming. And so that's where data science comes in and tells a story, where it combines not just the computational skills, but the statistical skills. But then there's also a requirement of having domain expertise. And that domain expertise, that can be anything. It can be healthcare. It could be climate change. It could be whatever field you want to enter. And so it's easier to relate that to caring for people or caring or having an impact in the world.”

    Today, we welcome Nathalie Guebels, Assistant Professor of Computer Science at Santa Barbara City College (SBCC), as she shares her transition from industry to academia and her larger commitment to making data science accessible to community college students. We touch on the development of SBCC's data science pathway, designed to create smooth transfer opportunities for students entering four-year universities. Nathalie also highlights her passion for supporting women in STEM, and details how she incorporates real-world datasets tailored to SBCC students. She talks about designing and co-teaching the cross-disciplinary course "Data Science for All," as well as reflects on the key role that collaboration has played in shaping SBCC’s growing data science program.

    “Bringing more real world examples and your own stories into the classroom gets the students more engaged and excited with the material because they connect it with their lives and their interests…I'll look at data from my classes, or we'll talk about SBCC farmers markets. But I think one of the main things we've changed…is bringing more focus on SBCC students and on the students themselves.”



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceeducation.substack.com
    Show More Show Less
    21 mins
  • Navigating the Data Revolution: Insights from ADSA Founder Micaela Parker
    Jan 24 2025

    Access the full transcript for this episode

    If you're worried that the AI hype is subsuming data science, remember: you can't have AI without data—data science isn’t going anywhere.

    Welcome to Season 9 of the show! To start off the new year, listen in on our conversation with Micaela Parker, founder and executive director of the Academic Data Science Alliance (ADSA). Micaela shares her journey into data science leadership, emphasizing the importance of inclusivity, interdisciplinarity, and community-building in the evolving fields of data science and AI. She reflects on the founding of ADSA and its mission to support faculty, staff, and students in designing, building, and sustaining data science programs. She discusses how ADSA fosters collaboration through annual meetings, working groups, and workshops that showcase innovative pedagogy and best practices, such as teaching responsible AI and integrating social justice into data science education.

    “We actively solicit keynote speakers, panelists, and participants for all of our events from a diversity of backgrounds and institutions, because we believe strongly that you can only aspire to a career that you see yourself doing, and that starts with seeing someone like you in that role.”



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit datascienceeducation.substack.com
    Show More Show Less
    20 mins