Data Ethics

Columbus, OH -- March 7, 2021


Last semester, I took a class through the Big Ten Academic Alliance at Rutgers entitled Data: A Social History. This class was an exploration into a number of data-related historical topics, including the introduction of data and aggregation techniques, development of data applications over the past 200 years, and the history of data ethics and privacy.

For this post, I will be focusing on the third topic, as it has continued to be on my mind through the Winter months.

The applications of technology and data are nearly endless in the world of today. With the rise of the internet of things, and the continuing digitization of nearly everything in our lives, the opportunities for collection of data have exploded. Now, even the simplest things in our lives can be recorded in tables for future use. For instance, in my daily life, tasks such as turning my lights on (smart lightbulbs), communicating with parents (iMessage), getting a coffee (Google Maps, Starbucks app), or going on a run (Apple Watch) are recorded and likely analyzed.

This data has immense potential to lead massive change in our society- for the better or the worse.

In the realm of data for social good, we have seen extraordinarily valuable advances in healthcare throughout the pandemic. Data systems are at the backbone of the vaccine distribution process- a project which will save countless lives and eventually get life back to "normal". Additionally, systems such as Zoom and Teams have allowed many employees and students to continue working and learning remotely. We see apps like Waze which find more efficient routes for driving- reducing idling time in traffic. Finally, we have datafication of government, which is improving notoriously slow bureaucratic systems across the board. Many of these trends are still in their starting stages, but the impact of data is already visible.

But with good comes bad. As data is collected in so many areas of our lives, new concerns with security and privacy have (rightfully) emerged. One of the leading books speaking on these privacy concerns is The Age of Surveillance Capitalism by Shoshana Zuboff. I read this book back in January, and really appreciated the level of detail the author used to express her concerns. The immense amounts of data that are collected in many "free" services, such as Google Search, Instagram or Yahoo Finance is used to make money by big tech companies. Through analyzing user behavior, insights can be produced to advertise products, predict future behavior, and even develop a plan to change that behavior. Zuboff is very concerned with this consolidated power in the hands of a very small number of people, and the potential impact it could have on not only individuals but our democratic society as a whole.

This is where the ethics conversation comes in- something I wish I had been taught more in my four years at Ohio State. As a young data scientist, I need to be aware of the impacts of my work before, during and after a project. This requires asking questions, deciding what I believe is "right", and standing up for my beliefs. Too many algorithms have been designed with questionable intentions and with detrimental effects. For extensive reading on these, I recommend the books Weapons of Math Destruction by Cathy O'Neil and Algorithms of Oppression by Safiya Umoja Noble. I hope that students at other universities have had more extensive discussions in class about the impact of their work. In my curriculum, there was not a single class about ethics or the impact of big data on people’s lives. Without this education, how are we supposed to know what is right or wrong? For me, I have been lucky to be exposed in coursework outside of the typical curriculum, but I am not sure if others can say the same.

As I prepare to graduate and enter the workforce full-time, I am thinking about how my work will affect the world. I am starting at Procter & Gamble as a Product Supply Data Scientist. In this role, I intend to use Data Science, Logistics and OR techniques to improve efficiency of the company's transportation systems. Through this, I hope to not only save the company money, but to reduce the impact that transportation has on our climate.

There are a lot of huge conversations to go on the topic of Data Ethics. Governmental surveillance, corporate social responsibility and the rights to privacy are three of these. I intend to continue reading, talking with my colleagues and lending my voice in these conversations as they develop, guiding our society to a more ethical and connected status.