Posts Tagged 'OSINT'

Data Mining proves useful for Democratic Campaign

Howard Dean noted that by using private citizens’ credit card data, they could predict with 85% accuracy who you were likely to vote for.

Dean and Republican National Committee Chairman Mike Duncan were speakers at a luncheon at the National Press Club Nov. 5, where both reflected on what went right and wrong with their efforts during the 2008 election cycle.

Dean revealed his party used credit card data to predict voting outcomes. However, Democrats and the media have been highly critical of credit card companies that provide that data.

“We now can do what they can do,” Dean said. “We have your credit card data like they do. They’ve been for years doing something that we, until 2006, weren’t able to do. We can predict with 85 percent accuracy how you’re going to vote based on your credit card data without bothering to see what party you’re in – the Secretary of State’s office.”…

But while Congress has repeatedly slapped the hands of DHS for data mining for potential terrorists, the same methods are being used to garner votes.   Seems a bit odd doesn’t it?

There are concerns.  Danger Room’s “Barack Obama’s Privacy Challenge” sums up a number of them.

The Power of Virtual Teams

Excellent example on how the power of small motivated teams can serve as the antidote to the ills plaguing bloated beauracracies.

Hat tip to IntelFusion.

Matt Burton: “Set Your Analysts Free.”

Great post by Matthew Burton on the value of Open Source. He underscores my own inspiration for Project Grey Goose when he writes about the phenomenal but ultimately tragic Jim Gray rescue effort last year:

Freedom: The Jim Gray Rescue Effort

The Web is fast and free: fast, in the sense that you can make things happen overnight; and free, in the sense that you don’t need the boss’s permission to implement new ideas. This was no more apparent than in the case of Jim Gray.

Jim Gray was a database pioneer based in San Francisco. He disappeared while sailing in the Bay area last January. He was never found.

But his friends, many of whom were Silicon Valley magnates, did their best to try. One of these friends retasked DigitalGlobe imagery satellites to shoot fresh photos over 3,500 of sea. The result was 1400 gigantic images. So coders automatically split the images up into 560,000 smaller ones that could be easily reviewed; someone else contributed a computer program that automatically sharpened all of the images.

Now that they had over a half million pictures, they needed a way to solicit volunteers to look at them. Amazon.com already had a ready-made solution, so the images were posted on a special page that let virtual volunteers cull through these images in search of Gray’s sailboat. 12,000 volunteers reviewed all 560,000 images three times each.

All of that happened in three days. They didn’t find Gray’s sailboat. But it was the largest search party in history, on top of being simply amazing.

The Web’s speed and freedom made it possible. Had this job been given to an organization with a closed network, this never would have happened.

What about your organization’s intranet? Could your IT team split up those images into easy-to-consume portions? Could they write a script that automatically sharpened all 560,000 images? Could they create a tool that lets the rest of your workforce review the images from their own desks? Could your servers handle the load? And could all of that be accomplished in three days? In order to do that, you need to have the technical talent on hand.

Once you have the talent, those people need permission to work freely and create solutions on the fly. Do they have that permission? Or would your org have to submit justification statements, get contract managers to approve personnel reassignments, and run each line of code past the infosec people before finally getting something live?

On the Web, if you have an idea, you just do it. This is a philosophy and capability the Intelligence Community (and any large, data-driven organization) should keep in mind if it wants to fully exploit its data. Analysts will always have too much information to process and too little time in which to do it (and focusing the “open source intelligence” conversation on culling Web content contributes to that problem). To have any hope, they need an IT and bureaucratic infrastructure that will accommodate improvised solutions.

Again: if you want to make the most of open source, get an open network that lets analysts contribute their own material. Put all of your analysts onto one large network instead of letting them work on lots of small ones. Hire some developers and familiarize them with the analysts’ information problems. Then, set them free and let them experiment.