As a data scientist for IBM Consulting, I’ve been fortunate enough to work on several projects to fulfill the various needs of IBM clients. Over my time at IBM, I have seen technology applied to various use cases that I would have never originally considered possible, which is why I was thrilled to steward the implementation of artificial intelligence to address one of the most insidious societal issues we face today, racial injustice.

As the Black Lives Matter movement started to permeate throughout the country and world in 2020, I immediately wondered if my ability to solve problems for clients could be applied to major societal issues. It was with this idea that I decided to look for opportunities to join the fight for racial equality and found an internal IBM community working on projects that were to be released through the Call for Code for Racial Justice.

Adding my two cents to TakeTwo

There were numerous projects that were being incubated within IBM but I found myself drawn to one in particular that was looking both an implicit and explicit bias. That project was TakeTwo and was to become one of the seven projects that was released as an external open source project just over a year ago. The TakeTwo project uses natural language understanding to help detect and eliminate racial bias — both overt and subtle — in written content. Using TakeTwo to detect phrases and words that can be seen as racially biased can assist content creators in proactively mitigating potential bias as they write. It enables a content creator to check content that they have created before publishing it, currently through an online text editor. Think of this like a Grammarly for spotting potentially racist language. TakeTwo is designed to leverage directories of inclusive terms compiled by trusted sources like the Inclusive Naming Initiative.

Not only did TakeTwo allow me to apply my expertise to improve the project, but it also afforded me the opportunity to look inward at some of the implicit racial biases that I may have held but have been formerly unaware of. Working on TakeTwo was great way to work on a mission that matters for the world, while also providing a chance for self-reflection.

See the solution action: 

The Data Challenge

While working on TakeTwo it became abundantly clear that although the solution aims at detecting bias by fielding and evaluating massive amounts of data, it’s important to recognize that the data itself can hold implicit bias in itself. By leveraging Artificial Intelligence and open source technologies like Python, FastAPI, JavaScript, and CouchDB, the TakeTwo solution can continue to evaluate the data it ingests, and better detect when bias exists within it. For example, one word or phrase that may be acceptable to use in the United States may not be acceptable in Japan – so we need to be cognizant of this to the best of our ability and have our solution function accordingly. As someone who is passionate about data science, I know from firsthand that our model is only as good as the data we feed it. On that note, one thing I’ve learnt from working on this project is that we need better data sets that can help us train the machine learning (ML) models that underpin these systems. Kaggle datasets has been a great starting point for us, but if we want to expand the project to take on racism wherever it exists, we’ll nee more diverse data.

On a related note, the skills needed on projects like these go way beyond just data science. Particularly for this project, it was important to leverage linguistics experts who can help define some of the cultural nuances that exist in language that a system like TakeTwo either needs to codify or ignore. Only by working cross-discipline can we get to a workable solution.

Be part of the future

The value that ML and AI bring to enhancing solutions like TakeTwo is inspiring. From hiring employees, to getting approved for a loan at the bank, ML and AI is permeating into the way we interact with one another and can help ensure we remove as much racial bias as possible for business decision-making. As technologists, we have a distinct responsibility to produce models that are honest, unbiased, and perform at the highest level possible so that we can trust their output.

You can check out how we at IBM are building better AI here. TakeTwo continues to make strides in developing and strengthening its efficacy, and you can contribute to making this open source project better. Check out TakeTwo and all of the other Call for Code for Racial Justice Projects today!


This post is part of a series during Black History Month covering the relationship between artificial intelligence and social justice. 


More from Analyze: Build and scale AI

Five benefits of a data catalog

Imagine walking into the largest library you’ve ever seen. You have a specific book in mind, but you have no idea where to find it. Fortunately, the library has a computer at the front desk you can use to search its entire inventory by title, author, genre, and more. You enter the title of the book into the computer and the library’s digital inventory system tells you the exact section and aisle where the book is located. So, instead of…

The importance of governance: What we’re learning from AI advances in 2022

Over the last week, millions of people around the world have interacted with OpenAI’s ChatGPT, which represents a significant advance for generative artificial intelligence (AI) and the foundation models that underpin many of these use cases. It’s a fitting way to end what has been another big year for the industry. We’re at an exciting inflection point for AI. Adoption of AI among businesses is increasing, and research and AI development on foundation models is enabling use cases like generative…

How data, AI and automation can transform the enterprise

Today’s data leaders are expected to make organizations run more efficiently, improve business value, and foster innovation. Their role has expanded from providing business intelligence to management, to ensuring high-quality data is accessible and useful across the enterprise. In other words, they must ensure that data strategy aligns to business strategy. Only from this foundation can data leaders foster a data-driven culture, where the entire organization is empowered to take advantage of automation and AI technologies to improve ROI. These…

Do you need coding skills to build a chatbot?

Virtual assistants are valuable, transformative business tools that offer a compelling ROI while improving the customer experience. By 2030, conversational AI chatbots and virtual assistants will handle 30% of interactions that would have otherwise been handled by a human agent—up from 2% in 2022.[i] So why haven’t more companies adopted this solution?  Due to the maturity and complexity of the technology, it can take years to reap the full benefits of developing a conversational AI platform. One challenge to implementing…