Skip to main content

Negative Data?

Here's a truly special idea - via those crazy Thrashers, I ran into this intriguing article from The Economist, which points to the world of Negative Databases, and how they might be able to help the world of encryption and data security.

In a world where sensitive data gets frequently lost, Data security folk are always trying to come up with the most secure way to store data. And it doesn't take a six year old to tell you that the best way to keep your data safe, is to not have it there in the first place.
"Pshaw!", I hear you say, "You can't store it and not store it at the same time!", and in a way, you'd be right.

But then, in another, more accurate way, you'd be a bit wrong. Consider the following statement:
"All Ravens are Black"
From here, you could make all kinds of crazy assertions about all black things being ravens, but these are incorrect, despite being amusing. What's not incorrect, is that:
"All Non Black-Things are Not Ravens"
Which , it turns out upon some reflection, is true.


So, the concept of a negative database is concerned with storing the absence of the things you'd like to store. If your customer database has a 20 char field for customer name, you'd then store in that table, every single permeation of the alphabet of your choice, up to 20 characters, excepting the names of your clients. Let's call that table Non_Customers.

Let's also say you used the standard 26 letter English Alphabet - that's 560,127,029,342,507,827,200,000 possible combinations of letters that you can cram into that field, based on my amateur permutations math of n!/(n-r)!

Let's be really generous, and say that you have 500,000 client records. So we end up with a table containing 560,127,029,342,507,826,700,000 records, all of which are precisely NOT your customers names.

All of your SELECT statements are now a bit harder to write, but with a little work, you could theoretically piece together the precise data that was missing from the table. And if someone was to find the database table lying around on a laptop, they don't actually have the data. They have everything else!

These numbers are stupidly big. When you consider that a very large database is classed as one with several billions of rows, you can rest assured that the non_customers table isn't going to be working it's way into your stored procedures anytime soon. But, as big as they are, they aren't infinite. Which means that as processing power increases, maybe one day it will be possible to store your entire backup as a secure database shadow...

Comments

  1. Yeah, you're right - it is unbelievaby time consuming. But right, there, you've hit it precisely on the head - remmeber that no encryption exists that can't be broke by someone, given enough time. Most security these days is based on the concept of making the maths so incredibly annoying, that people won't bother.

    So, maybe if you had a super computer at your disposal, making a negative imprint of your db for transporting data might not be such a bad idea. However, if the technology was readily and easily available, it would kind of defeat the purpose - it would just be a really slow way to store information...

    I just liked the idea of storing non-data... what sort of a kooky idea is that?

    ReplyDelete

Post a Comment

Popular posts from this blog

Going West vs Going to Sleep

Phew! That was one busy adventure to the other side of this wide brown land (It is wide, and brown, but mainly wide) TUF 2005 in Perth was the launching ground for our new product, ice. Stilly and I were presenting the keynote, which was based around showing off ice, and talking about collaboration and other reasons why a bunch of customers might want to buy it. In a stroke of genius\insanity, we decided to let the audience pick the demonstration platform based on random outcomes - we built a giant cardboard die with various operating systems and platforms written on each side - then we'd let a volunteer from the audience roll the dice(die?) to determine which platform we should do our demo on. ice (the italics belong to the marketing department) works on any platform, so we were pretty confident that we would be okay. But, what I hadn't counted on (those italics are mine), was my crummy laptop (which was acting as the server) deciding that it would be a good idea to hibernat...

Considerably smaller than Texas...

Well, after jonron 's nagging, I figured I better post something! It's weird - being so far away from home and in such a strange foreign place - you'd think that I'd have all kinds of things to say, but in truth most of the time I'm either so busy with work that I don't have time to post, or so lonely that I don't want to burden you all with my misery... (sob!) Anyway - I'm currently posting from the Best Western Hotel in Corpus Christi, Texas . (We have a TRIM Customer here who needs some help with configuring their records management system, so Simon and I have been helping out. ) I'm not sure that I'd ever want to stay at the Worst Western. Or even the Average Western, but no matter... Texas has been a pretty entertaining place to visit. Our efforts at finding a place to park ended in a church parking lot where the sign said "Clergy Only - Sinners Will be Prosecuted (and towed)" When we finally found the office, there was another gi...

The height of Retro cool?

Like Rory , I grew up with a lame arse PC. I too was bitterly jealous of those amiga owners. With their fancy fandanlged-hand-holding-a-floppy-disk bios, and versions of Marble Madness that looked just like the arcade, they had no idea how lucky they were. But, I'm not so sure that the grey box which evaporated my childhood, (while I'm very fond of it) was actually the height of eighties cool. In fact, the computer I owned was far, far worse than the virtual boy of PCs - something that made those poor betamax owners laugh themselves into hysterical coniptions as to what a loser of a product this thing actually was, and they paid 450 dollars for a flashing digital clock. My dad bought us a genuine, IBM PC-JX. The IBM PC-Jr is widely regarded as one of IBM's dumbest decisions. What very few know, is that after the IBM PC-Jr flopped dismally in the US, IBM was left with a bunch of leftover hardware that nobody wanted. I can hear the meetings now: shimmery dissolve in "Jo...