Automated image testing with Verified Pixel

By Nathan Willis
January 20, 2016

Verifying the authenticity of a digital image is no simple task, given the array of image-manipulation tools available and the difficulty inherent in tracing the provenance of any digital file. But attempting to establish the origin and veracity of a photo is not a lost cause. The non-profit organization Sourcefabric, which produces open-source journalism tools, has developed a web-based utility called Verified Pixel that attempts to make assessing an image's reliability an attainable goal.

The Verified Pixel site notes that news organizations are increasingly struggling with how to verify the authenticity of smartphone footage and other images captured by eyewitnesses' cameras in the field. A lot of news organizations solicit user-contributed images, which results in a glut of input and not enough time to perform forensic analysis of every photo. This is in spite of the fact that there are several well-known forensic tools available.

In April 2015, Sourcefabric and Eyewitness Media received a grant from the Knight Foundation to develop an image-verification web service to address the problem. Verified Pixel is the result of that effort. After teasing the prototype on Twitter in October 2015, Sourcefabric began rolling out access to a test server to beta testers in January 2016. I requested an invite, then spent some time kicking its tires and asking questions of the development team.

The source code is available on GitHub, although the dependencies are likely large enough to ward off casual users. Verified Pixel is implemented as a module for Sourcefabric's newsroom-management tool Superdesk (although the beta is currently a standalone web application, the intent is likely to make Verified Pixel a standard Superdesk component). The server side is written in Python, with an AngularJS client front-end. Like Superdesk, Verified Pixel is designed to be run on an organization's own server; the test server used for the prototype was maintained by Sourcefabric simply to solicit feedback on the program's functionality.

The idea is for Verified Pixel to provide a consistent, multi-user workflow: any user can upload images to the database, and a flexible battery of verification tests will be run to assess each image's reliability. Users can then flag individual images as suspect or potentially valid based on the scores of the validation tests. Authors or editors can subsequently select images to use taking those test results into account.

My pixel is my passport, verify me

The current battery of tests includes three online services: Google's reverse-image search (which will find suspiciously similar images if they exist), TinEye (which will attempt to locate probable duplicates of an image and determine which is the oldest), and Izitru (which runs a set of forensic tests to determine if an image has been edited). In addition, Verified Pixel automatically extracts any Exif data from the image and highlights key fields (for example, plotting GPS locations and camera direction, if available, on a map).

All users can add comments to images in the database, and the image database can be searched on all metadata fields, including the verification-test results. Images can also be grouped into collections several layers deep; the topmost layer is called a "desk," which is representative of a separate office within a publication's newsroom.

The Google and TinEye search results can help attest to whether or not an image has been published previously, which presumably would not be the case for a user-submitted photo of a current event. Such a test would still have value even for images that are not coming in as part of a breaking news story, such as detecting stock photos that were presented as original coverage.

The Izitru service runs six forensic tests and returns a "trust rating" on a one-to-five scale that corresponds to whether or not the image is in a "direct from camera" state or has been modified after the fact. Unfortunately, the service does not detail what the six tests are, although the Verified Pixel test-results page provides some rough descriptions. They include tests of how JPEG data is packaged within the file (based on differences between how cameras and software package JPEGs), tests to detect the traces of camera sensor patterns, and tests that look for artifacts of recompression. Founder Harry Farid has authored a lengthy list of research papers on image forensics that likely cover the same ground.

These tests also depend, at least in part, on analysis of known camera characteristics. The FAQ page notes that "each digital camera has distinct ways of applying JPEG settings when saving a file. Other differences result from artifacts that are introduced when images are saved multiple times." The site elsewhere says that it "relies on a database of 'signatures' that describe the distinct ways that different camera models store their JPEG files" and that cameras not in the database will likely not receive the highest trust rating.

So Izitru can rate the likelihood that an image is unmodified; if the image has been modified, though, further testing is required to determine what the modifications were. Simple resizing is not particularly bad, while adjusting color curves or editing for content are far more serious issues. Izitru is limited to analyzing JPEG files, which may be of concern to some users who would be interested in support for camera raw files. But the JPEG limitation is in line with what many news organizations ask from user-contributed content; in November 2015, the Reuters news service announced it would only accept image files from freelancers that were shot as JPEG.

Looking forward

For free-software developers, however, the service's proprietary and secret test battery is likely a larger issue. The good news is that Verified Pixel is designed to support pluggable test modules, and Sourcefabric's Douglas Arellanes said support for additional verification services is still to come. In an email, Arellanes said that the project spent a fair amount of time working with both OpenCV and ccv on modification-detection and other tasks using machine vision, but ultimately found that it was not a priority for users:

But for a lot of the basic things verifiers need to answer - like "has this image been published elsewhere on the Internet?" or "is this image altered?" it really made more sense to use APIs to services like Google reverse image search or to Izitru, because we were not going to be able to duplicate those any time soon.

Still, he said that he hopes to attract code contributions to support open-source libraries, since there is a lot that could be applicable. The team has an informal wishlist of additional tests it would like to see, such as automatic image tagging to recognize images containing traumatic content and a method to compare weather data (based on geolocation and timestamps) with the apparent conditions indicated in an image. The traumatic-content issue is one that Verified Pixel developer Sam Dubberley has explored in his own research.

But, for now, the project is focused on getting real-world feedback from news organizations. Arellanes said that "we need to see where the bottlenecks are for image verifiers - is it in workflow and getting the images into newsrooms' CMSes? Is it in making metadata easier? This is what we'd like to glean" from testing. He also added that Verified Pixel may be useful beyond the initial newsroom use case. "We think it has uses in a human rights context as well as in any situation requiring image verification - insurance claims, for example."

The beta runs its test battery on new uploads quickly, and searching and sorting are both painless. Given that the Google and TinEye assessments rely on a massive corpus of published-image data, it is hard to imagine comparable tests that do not dictate the use of a proprietary service. The Izitru modification-detection tests, though, could face stiff competition from other libraries if the project attracts a development community. It the meantime, it is clear to see how automating all such tests and collating the results can help users—in a newsroom or elsewhere—simplify the job of sifting through digital images of unknown trustworthiness.

Automated image testing with Verified Pixel

Posted Jan 21, 2016 7:50 UTC (Thu) by TRS-80 (guest, #1804) [Link]

One integration worth looking at would be the Granuiad's Grid image managament service, although its architecture isn't for the faint-hearted either.

Automated image testing with Verified Pixel

Posted Jan 21, 2016 15:36 UTC (Thu) by droundy (subscriber, #4559) [Link] (6 responses)

I was disappointed to see that this article had nothing to do with any sort of secure verification. It seems like it should be possible for camera makers to embed a private key into cameras that could be used to sign images. Obviously you would want this to be optional and easily scrubbable, but it seems like it could be extremely valuable in law courts as well as journalism.

Automated image testing with Verified Pixel

Posted Jan 21, 2016 21:40 UTC (Thu) by gwolf (subscriber, #14632) [Link] (5 responses)

You mean like with Nikon's Image Authentication System, <a href="http://nikonrumors.com/2011/04/28/nikon-image-authenticat...">cracked in 2011</a>, or Canon's Original Data Security, <a href="http://www.cnet.com/news/canon-camera-encryption-cracked/...">cracked in 2010</a>?

No, I know you probably mean signed by your own key. This would require other kind of adjustments at the legal level; an automatically-signed file does not (often) qualify for "electronic signature" (a device will do it regardless of the human pushing the button; it just authenticates the device, not the human). It would be a neat feature, but I doubt many people would understand it enough to use it.

Automated image testing with Verified Pixel

Posted Jan 21, 2016 21:42 UTC (Thu) by gwolf (subscriber, #14632) [Link]

(...sorry for not looking at the obvious, I sent this marked as plain text instead of HTML)

Automated image testing with Verified Pixel

Posted Jan 22, 2016 2:18 UTC (Fri) by droundy (subscriber, #4559) [Link] (3 responses)

I was thinking of an embedded private key (like a TPM), which would enable verification (versus, say, a manufacturer's database of public keys) that a photo (or video) had not been tampered with after being recorded. The goal wouldn't so much be for users to use it as to allow phone images and videos to be "proven" to be authentic. Naturally, it would rely on storing the private key in hardware that cannot be read, and would have privacy issues (a reason to store the signature in an easily-stripped exif tag or something, and have an option to disable it).

But it seems to me like cell phone videos and photos are now a sufficiently important aspect of how our society places blame and responsibility that we should be thinking about how to mitigate the danger of tampering.

Automated image testing with Verified Pixel

Posted Jan 23, 2016 0:50 UTC (Sat) by excors (subscriber, #95769) [Link] (1 responses)

On a phone, the image data is likely to go from the camera sensor through the ISP and into memory, then maybe through some postprocessing on the CPU and/or GPU, then to the JPEG encoder, and then you'd want to sign the JPEG data. You would have to keep that entire path secured, else it'd be fairly easy for a user to root their phone and inject their own doctored image data somewhere into the middle of the pipeline and get it signed with the secret key.

That doesn't sound too implausible to me - some mobile SoCs already have a secure path from video decoder to GPU to display (for DRM video playback), which prevents untrusted software (e.g. the Linux kernel and everything running under it) from accessing the protected memory, and they're better than the Nikon/Canon ones mentioned above at keeping their secret keys secret. But I doubt the ISP blocks are currently hooked into that system, so you'd need to find a sufficiently compelling reason for the SoC vendors to make those hardware changes.

Then you'd need to find a way to detect people who simply print out their doctored photo and take a legitimately-signed photo of that photo.

Automated image testing with Verified Pixel

Posted Jan 24, 2016 1:44 UTC (Sun) by giraffedata (guest, #1954) [Link]

I guess it would be substantially more feasible in a dedicated camera than in a general purpose computer. You can probably harden a dedicated camera pretty well against modification by the user. You could at least reduce the problem down to serious criminals, rather than anyone good with computers.

A serious camera even has distance detectors (and adjustable focus), which would make it pretty hard to fake a scene just by photographing a photograph. You might have to sculpt an elaborate replica of the fake scene.

Automated image testing with Verified Pixel

Posted Jan 23, 2016 1:35 UTC (Sat) by Cyberax (✭ supporter ✭, #52523) [Link]

I think Polaroid does this for specialized markets like law enforcement (you really want to keep the chain of custody clear).

Automated image testing with Verified Pixel

Posted Jan 22, 2016 11:04 UTC (Fri) by Karellen (subscriber, #67644) [Link] (1 responses)

Thankyou for the "Sneakers" reference. :-)

Automated image testing with Verified Pixel

Posted Jan 22, 2016 12:15 UTC (Fri) by spaetz (guest, #32870) [Link]

> Thankyou for the "Sneakers" reference. :-)
Seconded. Details like this are not lost on your audience! :-)

Automated image testing with Verified Pixel

Posted Jan 23, 2016 17:27 UTC (Sat) by yodermk (subscriber, #3803) [Link]

I suspect that if this were to become widely used, someone would quickly write a Photoshop or Gimp plugin that saves a (modified) JPEG in the "signature" style of a well-known camera ...

Intriguing stuff though.

Automated image testing with Verified Pixel

Posted Jan 27, 2016 9:23 UTC (Wed) by Rudd-O (guest, #61155) [Link] (2 responses)

The *Knight Foundation*? I thought they had already come up with, and shipped, this sort of software in their flagship Knight Industries Two Thousand, and that was back in the eighties!

Automated image testing with Verified Pixel

Posted Jan 27, 2016 19:02 UTC (Wed) by bronson (subscriber, #4806) [Link]

it was a great demo but never shipped in volume. Just unmaintainable code. They seem to have edited any buggy behavior out of the TV show.

Automated image testing with Verified Pixel

Posted Jan 29, 2016 16:32 UTC (Fri) by n8willis (subscriber, #43041) [Link]

You may be conflating Knight Industries with the Foundation for Law and Government (FLAG). Separate entities, albeit with some common board members. Not unlike Mozilla, at least in that respect.

Nate