Memento provides next-generation technology and solutions that enable financial institutions to rethink and improve the way they combat fraud and manage compliance. Memento customers realize unmatched business value and rapid ROI.

bank fraud forum

Data Matters

April 28, 2009 by Mike Braatz
2 comment(s)

I spent last week in a series of meetings with fraud professionals from various large and mid-market banks. The discussions hit on a wide variety of timely and compelling topics – online fraud, compliance, case investigations, internal fraud, and more. Wanting to share some of the key takeaways from my trip, I spent some time reflecting on the meetings during my 5-hour flight home. After reviewing my notes and replaying the discussions, I kept coming back to one of the bedrock principles of enterprise software in general, and fraud prevention in particular: the importance of good data. Put another way, data matters.

In the world of fraud prevention solutions, there's a lot of talk about detection techniques – things like business rules, predictive analytics, collusive fraud networks, neural nets, real-time prevention and many more. But no matter the technique, the one thing on which they all rely is good data. In every case, it means bringing together financial transactions as well as reference sources like account maintenance data, web logs for online banking, internal system access records, etc. The idea is to provide as rich a data environment as possible, so fraud decisions are made in context of everything that is going on with a customer and their account across all products and channels. As a general principal, the more data, the more context, the better.

But making good data available to a detection system is much easier said than done, and most technology solutions fail (some miserably) in this regard. Why is this? Well, one big reason is that the databases underlying most fraud detection and case management applications require data to be formatted in a very specific way. This data formatting step is commonly known as extract, transform and load (ETL), and it is often long, complex and cumbersome. Solutions that rely on ETL to get data into their system either pass off this problem to the IT department of the bank or credit union (nice thanks guys), or they charge for it in the form of a long (read: very expensive) professional services gig. And, to make matters worse, the resulting data environment is inflexible – getting additional data fields into it in the future requires doing the whole ETL process all over again. Not good when fraud is constantly evolving and the need for additional data in the future is a near certainty.

I'm sure there are some innovative solutions and recommendations out there, and we'd like to hear about them. How have you solved this problem?


Make a Comment

* = Required
*
*
*
*
 

Recent Comments:

Marlon
May 12, 2009 - 9:16 AM
"You are absolutely correct Mike. Data does matter. There are solutions available that make getting to the data very flexible and adding new tests to identify risks and controls easy (read..in a matter of hours). There now needs to be a marriage of out of the box solutions with a flexible framework to allow you to adjust to the environment. And the good news is that there is. "
Jo Prichard
November 5, 2009 - 7:48 PM
"I'm scanning back in the blog and I've just come across this post. Data does matter indeed. Size matters too! Even if you have the Data, if you are unable to scale dynamically, you're no better off. It's a bit like being all dressed up, with no where to go. Streamlining the ETL process is essential towards speeding up the reaction time in responding to risk but it really forms a component of an institutions ability to respond to risk. The whole process is a team effort, from Marketing through to the investigators and anaylists. Of course, when you have a massive super computing platform at your disposal, it kind of changes the perspective and scale of the solutions and approaches one can entertain and contemplate. My question to you would be the following. If there was no limit on data size and every analytical algorythm you wanted to test ran in 2 milliseconds, how would that change your approach? In a perfect data and computing world, what would you like to see? Regards Jo "