Archive for the ‘Privacy’ Category

Can Big Data From Cellphones Help Prevent Conflict? [Privacy?]

Wednesday, April 10th, 2013

Can Big Data From Cellphones Help Prevent Conflict? by Emmanuel Letouzé.

From the post:

Data from social media and Ushahidi-style crowdsourcing platforms have emerged as possible ways to leverage cellphones to prevent conflict. But in the world of Big Data, the amount of information generated from these is too small to use in advanced data-mining techniques and “machine-learning” techniques (where algorithms adjust themselves based on the data they receive).

But there is another way cellphones could be leveraged in conflict settings: through the various types of data passively generated every time a device is used. “Phones can know,” said Professor Alex “Sandy” Pentland, head of the Human Dynamics Laboratory and a prominent computational social scientist at MIT, in a Wall Street Journal article. He says data trails left behind by cellphone and credit card users—“digital breadcrumbs”—reflect actual behavior and can tell objective life stories, as opposed to what is found in social media data, where intents or feelings are obscured because they are “edited according to the standards of the day.”

The findings and implications of this, documented in several studies and press articles, are nothing short of mind-blowing. Take a few examples. It has been shown that it was possible to infer whether two people were talking about politics using cellphone data, with no knowledge of the actual content of their conversation. Changes in movement and communication patterns revealed in cellphone data were also found to be good predictors of getting the flu days before it was actually diagnosed, according to MIT research featured in the Wall Street Journal. Cellphone data were also used to reproduce census data, study human dynamics in slums, and for community-wide financial coping strategies in the aftermath of an earthquake or crisis.

Very interesting post on the potential uses for cell phone data.

You can imagine what I think could be correlated with cellphone data using a topic map so I won’t bother to enumerate those possibilities.

I did want to comment on the concern about privacy or re-identification as Emmanuel calls it in his post from cellphone data.

Governments, who have declared they can execute any of us without notice or a hearing, are the guardians of that privacy.

That causes me to lack confidence in their guarantees.

Discussions of privacy should assume governments already have unfettered access to all data.

The useful questions become: How do we detect their misuse of such data? and How do we make them heartily sorry for that misuse?

For cell phone data, open access will give government officials more reason for pause than the ordinary citizen.

Less privacy for individuals but also less privacy for access, bribery, contract padding, influence peddling, and other normal functions of government.

In the U.S.A., we have given up our rights to public trial, probable cause, habeas corpus, protections against unreasonable search and seizure, to be free from touching by strangers, and several others.

What’s the loss of the right to privacy for cellphone data compared to catching government officials abusing their offices?

Our Internet Surveillance State [Intelligence Spam]

Tuesday, March 26th, 2013

Our Internet Surveillance State by Bruce Schneier.

Nothing like a good rant to get your blood pumping during a snap of cold weather! ;-)

Bruce writes:

Maintaining privacy on the Internet is nearly impossible. If you forget even once to enable your protections, or click on the wrong link, or type the wrong thing, and you’ve permanently attached your name to whatever anonymous service you’re using. Monsegur slipped up once, and the FBI got him. If the director of the CIA can’t maintain his privacy on the Internet, we’ve got no hope.

In today’s world, governments and corporations are working together to keep things that way. Governments are happy to use the data corporations collect — occasionally demanding that they collect more and save it longer — to spy on us. And corporations are happy to buy data from governments. Together the powerful spy on the powerless, and they’re not going to give up their positions of power, despite what the people want.

And welcome to a world where all of this, and everything else that you do or is done on a computer, is saved, correlated, studied, passed around from company to company without your knowledge or consent; and where the government accesses it at will without a warrant.

Welcome to an Internet without privacy, and we’ve ended up here with hardly a fight.

I don’t disagree with anything Bruce writes but I do not counsel despair.

Nor would I suggest any stop using the “Internet, email, cell phones, web browser, social networking sites, search engines,” in order to avoid spying.

But remember that one of the reasons U.S. intelligence services have fallen on hard times is the increased reliance on “easy” data to collect.

Clipping articles from newspaper or now copy-n-paste from emails and online zines, isn’t the same as having culturally aware human resources on the ground.

“Easy” data collection is far cheaper, but also less effective.

My suggestion is that everyone go “bare” and load up all listeners with as much junk as humanly possible.

Intelligence “spam” as it were.

Routinely threaten to murder fictitious characters in books or conspire to kidnap them. Terror plots, threats against Alderaan, for example.

Apparently even absurd threats, ‘One Definition of “Threat”,’ cannot be ignored.

A proliferation of fictional threats will leave them too little time to spy people going about their lawful activities.

BTW, not legal advice but I have heard that directly communicating any threat to any law enforcement agency is a crime. And not a good idea in any event.

Nor should you threaten any person or place or institution that isn’t entirely and provably fictional.

When someone who thinks mining social networks sites is a blow against terrorism overhears DC comic characters being threatened, that should be enough.

[T]he [God]father of Google Glass?

Tuesday, October 23rd, 2012

The original title is 3 Big Data Insights from the Grandfather of Google Glass. The post describes MIT Media Lab Professor Alex ‘Sandy’ Pentland as the “Grandfather of Google Glass.”

Let’s review Pentland’s three points to see if my title is more appropriate:

1) Big Data is about people.

SP: Big Data is principally about people, it’s not about RFID tags and things like that. So that immediately raises questions about privacy and data ownership.

I mean, this looks like a nightmare scenario unless there’s something that means that people are more in charge of their data and it’s not something that can be used to spy on them. Fortunately as a consequence of this discussion group at the World Economic Forum, we now have the Consumer Privacy Bill of Rights which says you control data about you. It’s not the phone company, it’s not the ad company. And interestingly what that does is it means that the data is more available because it’s more legitimate. People feel safer about using it.

I feel so much better knowing about the “Consumer Privacy Bill of Rights.” Don’t you?

With secret courts, imprisonment without formal charges, government sanctioned murder, torture, in the United States or at its behest, my data won’t be used against me.

You might want to read Leon Panetta Plays Chicken Little before you decide that the current administration, with its Consumer Privacy Bill of Rights has much concern for your privacy.

2) Cell phones are one of the biggest sources of Big Data. Smart phones are becoming universal remote controls.
….
Not so much in this country but in other parts of the world, your phone is the way you interface through the entire world. And so it’s also a window into what your choices are and what you do.

Having a single interface makes gathering intelligence a lot easier than hiring spies and collaborators.

Surveillance is cheaper in bulk quantities.

3) Big Data will be about moving past averages to understanding patterns at the individual level. Doing so will allow us to build a Periodic Table of human behavior.

SP: We’re moving past this sort of Enlightenment way of thinking in terms of markets and competition and big averages and asking, how can we make the information environment at the human level, at the individual level, work for everybody?

I see no signs of a lack of thinking in terms of markets and competition. Are Apple and Google competing? Are Microsoft and IBM competing? Are the various information gateways competing?

It is certainly that case that any of the aforementioned and others, would like to have everyone as a consumer.

Equality as a consumer for information service providers isn’t that interesting to me.

You?

The universal surveillance that Pentland foresees does offer opportunities for topic maps.

The testing of electronic identities tied to the universal interface, a cell phone.

For a fee, an electronic identity provider will build an electronic identity record tied to a cell phone with residential address, credit history, routine shopping entries, etc.

Topic maps can test how closely an identity matches other identities along a number of dimensions. (For seekers or hiders.)

The quoted post by: Conor Myhrvold and David Feinleib.

I first saw this at KDNuggets.

The Man Behind the Curtain

Tuesday, September 25th, 2012

The Man Behind the Curtain

From the post:

Without any lead-in whatsoever, we just ask that you watch the video above.

And we ask that you hang on for a few moments—this goes far beyond the hocus pocus you’re thinking the clip contains.

You really need to see this video.

Then answer:

Should watchers to watch themselves?

Should people watch the watchers?

Lifebrowser: Data mining gets (really) personal at Microsoft

Saturday, March 17th, 2012

Lifebrowser: Data mining gets (really) personal at Microsoft

Nancy Owano writes:

Microsoft Research is doing research on software that could bring you your own personal data mining center with a touch of Proust for returns. In a recent video, Microsoft scientist Eric Horvitz demonstrated the Lifebrowser, which is prototype software that helps put your digital life in meaningful shape. The software uses machine learning to help a user place life events, which may span months or years, to be expanded or contracted selectively, in better context.

Navigating the large stores of personal information on a user’s computer, the program goes through the piles of personal data, including photos, emails and calendar dates. A search feature can pull up landmark events on a certain topic. Filtering the data, the software calls up memory landmarks and provides a timeline interface. Lifebrowser’s timeline shows items that the user can associate with “landmark” events with the use of artificial intelligence algorithms.

A calendar crawler, working with Microsoft Outlook extracts various properties from calendar events, such as location, organizer, and relationships between participants. The system then applies Bayesian machine learning and reasoning to derive atypical features from events that make them memorable. Images help human memory, and an image crawler analyzes a photo library. By associating an email with a relevant calendar date with a relevant document and photos, significance is gleaned from personal life events. With a timeline in place, a user can zoom in on details of the timeline around landmarks with a “volume control” or search across the full body of information.

Sounds like the start towards a “personal” topic map authoring application.

One important detail: With MS Lifebrowser the user is gathering information on themselves.

Not the same as having Google or FaceBook gathering information on you. Is it?

Printer Dots, Pervasive Tracking and the Transparent Society

Saturday, October 29th, 2011

Printer Dots, Pervasive Tracking and the Transparent Society

From the post:

So far in the fingerprinting series, we’ve seen how a variety of objects and physical devices [1, 2, 3, 4], often even supposedly identical ones, can be uniquely fingerprinted. This article is non-technical; it is an opinion on some philosophical questions about tracking and surveillance.

Here’s a fascinating example of tracking that’s all around you but that you’re probably unaware of:

Color laser printers and photocopiers print small yellow dots on every page for tracking purposes.

My source for this is the EFF’s Seth Schoen, who has made his presentation on the subject available.

If you are tracking the provenance of data in your topic map, does that mean that you are also tracking the users who submitted it?

And is that tracking “transparent” to the users who are being tracked or only “transparent” to the aggregators of that submitted content?

Or is that tracking only “transparent” to the sysops who are working one level above the aggregators?

And at any level, what techniques would you suggest for tracking data, whether transparent or not?

For that matter, what techniques would you suggest for detecting tracking?

Ironic that “transparent” government may require that some (all?) of its citizens and their transactions be “transparent” as well. How else to track the web of influence of lobbyists of various sorts if their transactions are not “transparent?” Which then means their employers will need to be “transparent.” Along with the objects of their largesse and attention. And the web of actors just keeps spreading out.