Big Data Ignite 2020 Webinar Series

DATA-DRIVEN IN UNCERTAIN TIMES

SEP 15 – OCT 15

Tuesdays and Thursdays 12-1 EDT

Bonus Sessions: Oct 20 and 22

Tuesday, 10/20 - How to build a scalable data lake in 45 minutes

Presenters:

Balu Vyamajala, Principal Software Engineer, Initech Global
Mahesh Boosi, Lead Data Engineer, Initech Global

Abstract:

This talk will go over how to build a scalable data lake in the cloud. The presenters will highlight the ability to consume streaming data from AWS Kinesis, build a Datalake in S3 and run SQL queries from Athena.

Thursday, 10/22 - How to build a Web Application with custom tags and image recognition in a jiffy!
Presenters:
Balu Vyamajala: Principal Software Engineer, Initech Global
Santhosh Surapuram: Lead DevOps Engineer, Initech Global

Abstract:

This talk will go over how to build a web application to manage images and videos with custom tagging and the ability to query in the cloud with image recognition in a matter of minutes.

Past: 9/15 - Predictive Maintenance Pipeline Using R

—————————
To receive a link to watch the recording of this talk now, register at https://bigdataignite.org/register/
—————————

Presenter: Nagdev Amruthnath

LinkedIn Profile

Slide Deck

Abstract:

The manufacturing industry is currently going through its fourth industrial revolution with advancements in quality, computer vision, safety, and maintenance. Among these advancements, predictive maintenance is seen as the magical unicorn for achieving a competitive edge and significantly improving efficiency. In a study by PWC, it is estimated that by implementing predictive maintenance, manufacturing industries could reduce cost by 12 percent, improve uptime by 9 percent, reduce various risks by 14 percent and extend the life of the machine by 20 percent. In this presentation, we introduce predictive maintenance, build predictive models using R, and deploy them in a production environment.

 Co-Sponsored by:

West Michigan R Users Group

Past: 9/17 - Reproducibility: Lessons Learned Building a Platform for Educational Data Science Replications

—————————
To receive a link to watch the recording of this talk now, register at https://bigdataignite.org/register/
—————————

Presenter: Christopher Brooks

LinkedIn Profile

Slide Deck

Abstract:

This presentation will outline a program of reproducibility research in the domain of learning analytics with a specific focus on predictive models of student success.

Past: 9/22 - Optimization and Data Analytics Tools for Addressing COVID-19 Related Problems

Presenter: Siqian Shen, http://www-personal.umich.edu/~siqian/

Slide Deck

Abstract:

The outbreak of coronavirus disease 2019 (COVID-19) has created a global health crisis and the response to the COVID-19 pandemic is deeply influenced by local, national and global policies and decisions. In this talk, we present a few examples to demonstrate (i) how infection status dynamically affects mobility patterns and travel behavior, (ii) how to optimize business/state reopening and closedown strategies, (iii) how to distribute testing kits and locate testing facilities, and (iv) how to redesign public transit systems like city buses to reduce passengers¡¯ infection risk. In particular, we show the use of data analytics tools and optimization models for solving these problems, validated using real data of COVID-19 infection, business economy and local mobility. The talk is based on an ongoing collection of COVID-related system engineering problems and their solution methods at:  https://sites.google.com/umich.edu/decision-tools-for-covid19/

Sponsored By:
Jing

Past: 9/24 - Putting an Event Driven Architecture at the Heart of Your Organization

—————————
To receive a link to watch the recording of this talk now, register at https://bigdataignite.org/register/
—————————

Presenter: Mark Fei

LinkedIn Profile

Abstract:

In this session we’ll discuss the rapidly spreading approach organizations are embracing, moving to a real-time, event driven architecture for all their applications. We’ll answer questions like, “Why?”  “What are the advantages?” and we’ll examine some real life use cases that bring these answers into focus. Following that, we’ll delve into an overview of Apache Kafka, the de facto standard for achieving these transformations. During and after the presentation you will be able to ask questions of the speaker.

 Sponsored By:

Confluent

Past: 9/29 - The Academic to Industry Pipeline: Leveraging Academic Research for Industry Applications

—————————
To receive a link to watch the recording of this talk now, register at https://bigdataignite.org/register/
—————————

Presenter: David Corliss

LinkedIn Profile

Slide Deck

Abstract:

With so much Big Data technology starting out in university and government research centers, learning to leverage these resources and getting them to work in the real world presents special challenges. This presentation maps out a step-by-step process to move emerging smart technology from lab, classroom, and journal all the way through to industrial applications.

 

Past: 10/6 - The Peanut Butter Problem: Building an Out of Stock Product Recommender

—————————
To receive a link to watch the recording of this talk now, register at https://bigdataignite.org/register/
—————————

Presenters:
Terry Crist, LinkedIn Profile
Steve Krause, LinkedIn Profile
Tom Weinandy, LinkedIn Profile

Slide Deck

Abstract:

The peanut butter problem occurs when a customer orders a specific kind of peanut butter to be delivered in a grocery order, but the picker fulfilling the order sees that it’s not on the shelf. Which of the fifty other kinds of peanut butter should the picker select instead? This talk lays out a framework for substituting out of stock items from the perspective of a retail store or order fulfillment platform. We first define the problem from a business perspective and argue for how this solution helps a store’s bottom line. Then we propose a design for building a recommendation engine that uses available data to estimate a percent match between all products within a store. Finally, we propose an architecture for hosting the product data and recommender using a modern data warehouse. Our proof of concept shows that the recommendation engine successfully identifies meaningful product substitutions. The design is robust to work with limited data and flexible to allow for customization at both the store level and customer level.

Past: 10/1 - Actionable Ethics for Data Scientists

—————————
To receive a link to watch the recording of this talk now, register at https://bigdataignite.org/register/
—————————

Presenter:

Jay Qi

LinkedIn Profile

Follow on Twitter

Slide Deck

Abstract:

There is a growing awareness of ethical concerns in data science like algorithmic bias. The next step is to apply ethics to everyday practice. In this talk, you will learn about an actionable approach that enables you as a data scientist to start identifying and addressing ethical concerns in your work today. We will discuss the idea of ethics checklists, and show you how you can use an open-source tool, deon (deon.drivendata.org), to integrate an ethics checklist into your data science projects. Deon and the checklist framework enables you to actively consider the ethical implications of your work, and to preemptively address issues that may otherwise get overlooked. Rather than providing simple answers, our checklist framework spurs an ongoing dialogue that helps surface trade-offs, nuances, and unintended consequences. Using the default checklist in deon, we will walk through the relevant ethical concerns that arise at different points in the data science process—ranging from data collection to storage to modeling and deployment. We will use concrete real-world examples to illustrate the common ethical issues that arise. Ethics in data science requires more than just good intentions. Come learn how to jumpstart the conversation that all data science teams should be having and translate those good intentions into ethical actions.

 

Sponsored by: 
Mosaic Data Science | https://www.mosaicdatascience.com
Past: 10/8 - Data Science & Investigative Journalism: Combining Forces for Public Good

—————————
To receive a link to watch the recording of this talk now, register at https://bigdataignite.org/register/
—————————

Presented by:

Raymond Joseph, LinkedIn Profile
Adi Eyal, LinkedIn Profile

Slide Deck

Abstract:

The presenters will recount the development and application of data-analysis tools that were pivotal in investigative reporting that exposed widespread corruption in the South African lottery. The presenters will also comment on other applications of data analytics to global investigations, such as the UN-guaranteed right to access to clean water.

 

Co-presented by:

The Center for Collaborative Investigative Journalism (CCIJ) and the Padnos/Sarosik Civil Discourse Program at GVSU

Past: 10/15 - SAFE AI: Secure, Accountable, Fair, and Explainable

—————————
To receive a link to watch the recording of this talk now, register at https://bigdataignite.org/register/
—————————

Presented by: Sray Agarwal

LinkedIn Profile

Abstract:

This presentation will be an overview of ML/AI tools & techniques to achieve fair and ethical results across the data-collection and data-modeling lifecycle.

Sponsored By:

Initech Global

Past: 10/13 - How Big Data Ignite Responded to the Covid Crisis: Project Reignite and the SafeZones App

—————————
To receive a link to watch the recording of this talk now, register at https://bigdataignite.org/register/
—————————

Presenters: Elliott Church, Omar Malik, Tuhin Mitra, Randall Prium

Abstract:

“SafeZones” is a mobile/web app designed to facilitate safe return to normal commercial activity in the COVID-19 era. Safezones incorporates two functionalities: It provides consumers with a ready source of accurate information about COVID-19 safety/risk factors about local businesses and also provides local businesses with information regarding availability of personal protective equipment. SafeZones is a collaborative project of the Research Division of Big Data Ignite, The Right Place, Grand Rapids Chamber of Commerce, Kent County, Calvin University, Grand Valley State University, and other organizations. The overarching goal in creating SafeZones is to provide a data-driven tool designed to dispel unwarranted fear, restore consumer confidence, and motivate appropriate caution where needed. SafeZones uses data contributed by businesses, community organizations, and the general public to analyze health-and-safety conditions at local businesses and public venues. In this session, a panel of key contributors will discuss the goals, development, deployment, and data analytics of the SafeZones app.

Sponsored By:

Initech Global

Subscribe today to stay up to date on our events!

"I used the time to connect and learn from others who are down in the trenches, making these big data platforms work at a handful of large enterprises here in West Michigan. It was invaluable to talk to people that understand the various platform vendors out there and learn about the tool choices driving their IoT, data, and data lake infrastructure." Excerpt from "Trends out of 2017’s Big Data Ignite", by Mac Fowler - a digital strategist, product development specialist, and 2017 Big Data Ignite Participant

Mac Fowler

Director of Strategy , Collective Idea

Watch the

2019 Conference Recap!

Our Mission

The mission of Big Data Ignite is to provide education and training in areas of computing technology and practice that hold the greatest potential for innovation and benefit, and to promote the interchange of information and ideas among current and aspiring experts, practitioners, and decision-makers.

What We Do

Big Data Ignite provides a platform for local and global thought leaders to share insights, experience, and knowledge, to discuss current issues and emerging trends, and to create social bonds and business relationships that fuel innovation.

 

How You Benefit

Big Data Ignite provides you access to a broad-based community of experts, practitioners, and decision-makers who are actively engaged in advancing the practice of data analytics, data management, IoT, AI, and cloud computing. Big Data Ignite provides you a forum to share your knowledge and ideas and to stay up-to-date on the state of the art, current issues, and emerging trends.

2020 Sponsors

Gold Sponsors

Silver Sponsor

Bronze Sponsors