Analytics Teams for the Win, with John Thompson

About this Episode

In today’s episode, we go deep into the career of veteran industry innovator and leader John Thompson, Global Head of Artificial Intelligence & Rapid Data Lab at global biotechnology leader CSL Behring. John and Jesse discuss the art of the analytics team, how a leader can sell their data-vision to their C-Suite, and the potentially bright future ahead for data governance and monetization. John also shares more about his book Building Analytics Teams: Harnessing Analytics And Artificial Intelligence For Business Improvement, as well as his upcoming release, The Future of Data: What Happens to Your Data.

More about our host, Jesse Anderson
More about our guest, John Thompson
Find book recommendations and more resources for data professionals at dreamteam.soda.io

Video

Episode Transcript

John

Yeah, we do everything that everybody does. You know, we work in nimble cycles. We work quickly. We're very communicative. We always integrate subject matter experts into the process. We're mathematics and data experts. We're not subject matter experts. We don't know supply chain. We don't know clinical data. We don't know those things.

So our subject matter expert teams and our data teams and analytic teams are integrated tightly together. So yeah, we communicate as often and as freely and as fully as we possibly can. And we tell people, yeah, we think this is a six month project and probably it could be done in four, it could be done in eight. So we don't give any hard and fast dates as an end point, but we do give people what we think is a reasonable range. Now, generally what happens is that when we say that, there's some unease or consternation, but generally these projects return incredible ROI. We just did a project, and I explained to the executive sponsor that it could be six months, it could be eight months, it might be nine. But in the end we believe that the return will be somewhere between $24 and 39 million. And they're like, oh yeah, well, that doesn't matter if it's short. Great. And if it's a couple extra months, I don't care.

Jesse

Yeah. With that kind of ROI, any business person should be looking at that. And I wanna point out to everybody that's listening, this is how you sell analytics internally. You don't say, hey, we're going to do some kind of ML. We're going to do this. We are going to generate 24 million. We're going to do this. That's what perks up these business people's ears. So I think one of the things that's important is for people to hear how you position this so that you can continue your work, so that you can do your work. Really important to understand that.

You also have another book coming out in October/November timeframe coming out from Manning. It’s called The Future of Data: What Happens to Your Data. Could you give us a glimpse of what's going to be in this book?

John

Yeah, absolutely, Jesse, and thanks for the opportunity to talk about it. I'm always, like you, always thinking about the next book. So when I was writing Teams, I was thinking about, what am I gonna write next? And it came very clear to me talking to my sister and my brother and everybody that I talked to at the gas station, in the grocery store. I'm a very gregarious person. I talk to everybody that I can come in contact with. And whenever I’d bring up data, it was pretty clear that not many people understood what happens with and happened to your data in the current environment that we live in.

Now, I've been focused on data for nearly 40 years. So I thought, well, I have a perspective on this. I have something to say. So the book is really for every person in the world, every person that's connected to the internet anyway, and every person that uses Amazon or browses Spotify, or heaven forbid, uses Facebook. You know, it really talks about - this is what happens to your data. And that's the first third of the book. The middle third of the book talks about all the different rules and regulations and laws that are coming to be about data ownership and data privacy and data monetization. And then the last third of the book is what you can do as an individual to get ready to own and proactively manage and monetize your data. There's very few people that understand that in three to five years, you will be able to set the price of every piece of data you've ever generated.

Jesse

So you've talked about those different parts of the book. What's your favorite part of that?

John

I'm intrigued by all of it. The first third of the book was really just explaining the history of data. Why do we live in the world that we do? Why do we have the data ecosystem that we have? The middle third is very exciting. What's going on right now? And then I think if I had to say, you know, what is my favorite part, it's the last part. It's enlightening people as to hey, this is what's gonna happen. And some people I've talked to who are in the know say, oh, you know, everybody just wants free email or everybody wants free search or whatever.

You already gave them some of the advice early on that everything that I do is denominated in dollars, pounds, yen, euro. I never talk about speeds and feeds and data. I do that with my team, of course. We’re data scientists. We talk about those things in our project meetings. But when I'm talking to business people, it's all return on investment. It's you're gonna give me this and we're gonna give you that. It's a give to get situation. So you already said that that's every conversation I've ever had with a VP and C-level executive. And then after that, it's about being nimble.

That always helps, too. What do you think this manifests as? Let's say, somebody's listening to this and they're saying, "They're on a technical team. I treat them very technical, and I expect outcomes." What advice do you give them to say, no, you really need to do this creatively?

John

So I give each - if I can, if I have that many projects - I give each data scientist their own personal project portfolio. They own it. It's autonomous, they're responsible for it, and they have the authority to do whatever they need to do to get it done. Now, we'll have meetings. As I talked about, we have lots of meetings, lots of discussions. And sometimes I don't hear about a project for two or three weeks. Now, the reason I don't hear about it is that either they've got other priorities or they've run into a snag - the model doesn't work, the data isn't working, doesn't fit, isn't being integrated. And generally, if I haven't heard about it in three weeks, I'll ask.

Yeah, well, everything that we've done in the last 15, 16, 20 years has been neural network based. All the big breakthroughs you hear from Yann LeCun and Jeffrey Hinton and others, it's all neural networks all day all long. And they've done really great things. I don't mean to diminish what they've done. They're giants in the field, and they have brought us far in our understanding of data and what neural networks can do in the innovations they brought forward. So, kudos to you, gentlemen, but we need explainable AI.

We need that to work against neural networks and all other techniques. We need the work in causal algebra to take us past that. And I think one of the things that we'll see beyond any of these individual algorithmic approaches is ensemble modeling. We've had some real success in doing ensemble modeling, and that's bringing together many different datasets and many different models and stringing them together in a logical progression. So I think those things are what we need to see and what we will see in the near future.

Jesse

Tell me, why do you think explainable AI is so important in this?

John

I think it's very important because neural networks are hugely valuable to us, and they're ubiquitous, they’re pretty much everywhere. But in the most regulated fields like healthcare and pharmacy, where I'm at now, and financial services, we can't really use our most powerful techniques. And neural networks by and far are our most powerful techniques right now. We can't use them because we can't explain to the regulators that it's fair and that it's logical and it's ethical. So if we wanna use our most powerful techniques in all the different industries where they're appropriate, we need explainable AI.

Jesse

And I was having this conversation with somebody in a conference that I keynoted at. And I think that we're going to have, as probably part of some new GDPR-like law, a law requiring explainable AI. We have similar sorts of things in the US for financial services. Why were you declined for this credit? You have to explain that AI, that model. So it's going to be required. And I think we're going to have to go down that path. You've mentioned Judea Pearl's causal analytics. You just touched on it. Why do you think that that is going to be such a game changer?

John

Yeah. And I think you have. I think your book is really great. And I think it does that. And as you and I talked, as we were writing the books, I think we were writing 'em at the same time, that was the concept. That was the guiding principle. I don't think you and I discussed that, but that was, what we both came away with was that I did write and you did write a timeless book, something that isn't gonna change with the fashion.

Data engineering's role in the Center of Excellence. And many of the people that we hire as interns, we bring in data engineering. And they do a lot of the automation work and the data integration, the pipeline building, and then we give them data science work as they're more and more comfortable. It works out really well that way, because as a data engineer, you need to understand the data that you're working with.