Nate Cardozo, staff attorney for Electronic Frontier Foundation, said that the black boxes track such information as accelerator pedal position, brake pedal position, engine rpms, vehicle speed and acceleration, whether seat belts are connected, whether air bags deployed, and a lot more.This argument is largely nonsense. Why should drivers be entitled to privacy? Are people who drive special, or do they just engage in enough questionable behavior that they need protection? After all, air travelers and rail passengers have to have al of their movements recorded and logged. Even transit users who use monthly (or season, if you are in Europe) passes have their activity recorded. What does the EFF say about transit? From their website (in full):
The foundation's concerns include the fact that there is currently no cap on the amount of data collected and there are no limits on the kind of data that will be gathered, Cardozo said.
"The car manufacturers can use that data at will, including location, which has significant privacy implications," Cardozo said, which led to the scenario of a speed jaunt finding its way into the hands of an auto insurance company.
So public transit users will benefit from making data public, but drivers will be harmed. Sure, schedule data is different from vehicle data, but the data available from transit agencies is partly schedule data but also aggregated rider data, some of which can be traced to individual riders. EFF is right that transit riders deserve better service through innovative use of data. So do drivers! Marginal cost insurance is a good thing, not a bad thing. In addition, black boxes in cars can help assign blame in crashes more accurately so we can stop with the nonsense that a driver killing a pedestrian or cyclist was not the result of criminal driving. More data about usage can--no guarantee here as details matter--improve transport for everybody, including: marginal cost pricing, enhanced safety, lower cost travel for those who impose the least stress on transport systems, better routing and scheduling, stronger commitment to the user pays principle, punitive charges to lunkheads who shouldn't be driving, etc. Lastly, there is scant evidence that people (Americans, anyway) give a hoot about privacy in nearly all cases. How many actually use cash to pay for their EZPass? Put in the black boxes, I say, and make the data public while hiding the user ID. We will all be better off. And it will give researchers like me a lot to do.
Who Controls Data About Public Transportation?
How should city transit authorities treat independent software developers who make use of public schedule data? What approach results in the best experience for their passengers and customers?
Two models appear to be emerging to answer this question. One, typified by New York City'sMTA and Washington, DC's WMATA, sees schedule and related data as valuable intellectual property, to be zealously protected, licensed and monetized. So far, the results of this approach appear to have been bad press, irate passengers, wasted money and stymied innovation.
The other model, typified by San Francisco's SFMTA and Portland's TriMet, holds that encouraging independent developers to make free use of schedule information can both save the city money and foster innovative applications. As SFMTA San Francisco BART's Timothy Moore told Streetsblog: "We've put BART in front of customers in so many places that we wouldn't be able to do on our own. We basically can't envision every beneficial use for this public data and frankly transit agencies in general don't have the vision. We don't have the time, we don't have the resources."
In 2009, we've seen interesting developments in each of these four cities:
In New York City, developer Chris Schoenfeld created StationStops, an iPhone app that provided schedule information for Metro North, NYC's largest commuter rail system. The app ran smoothly until earlier this summer, when NYMTA contacted Schoenfeld to claim ownership of the schedule data and demand $5,000 in advance "royalties" on Schoenfeld's revenue.
Schoenfeld wisely recognized this as nonsense: Copyright law simply does not apply to publicly-available factual information. But when he declined to pay the licensing fees, NYMTA sent a takedown notice to Apple, demanding that StationStops be banned from the iPhone. Apple, of course, complied.
NYMTA's extortionate actions censored a helpful and perfectly legal use of their data. The results have been bad for their reputation and bad for their passengers. Connecticut's Stamford Advocate put it well: the MTA "should just leave (Schoenfeld) alone and let him make an honest buck by providing a useful service."
In Washington, DC, the Washington Metropolitan Area Transit Authority (WMATA) seems to be working hard to learn exactly the wrong lessons from NYC's example. After an online petition drive by DC transit activists, WMATA reluctantly opened their data to developers earlier this year. But they also allocated $500,000 (yes, that's five hundred thousand dollars,) for a study which they say "will give us a firm idea as to the commercial value of intellectual property like scheduling information."
We'll save them the trouble: While it's possible they may be able to wrench some value from their trademarks (even though this tactic, too, has backfired embarassingly for NYMTA,) there is no economic value in their schedule information. Any attempt to restrict others' use of this data is baseless and counterproductive. They've already opened their schedule data — if they're smart, they'll keep it that way.
Here in San Francisco, the SF Municipal Transportation Agency (SFMTA) has made great strides towards a first-rate open transit data system, and is setting an example that other transit authorities should aspire to. Schedule data has long been available from the SFMTA in the excellent Google Transit Feed Specification format. And websites like SFMTA Labs and theBART Developer Center encourage and help developers to make use of the data.
However, this silver cloud does have a dark lining: While SFMTA itself has refrained from sending baseless takedown notices, a corporation called NextBus Information Services (NBIS)hasn't been so wise. In 2008, developer Steven Peterson created an iPhone application calledRoutesy, which provides passengers with real-time updates of bus and train locations and arrival times. Then, last month, NBIS contacted Peterson, claimed ownership of the real-time arrival data, and demanded that Routesy be discontinued. When Peterson refused, NBIS asked Apple to ban Routesy from the iTunes App Store. Apple, of course, complied.
NBIS, like the NYC MTA, appears guilty of copyfraud. They've been unable to produce any proof that they do, in fact, own the data in question. SFMTA, to their credit, quickly clarified the situation, telling SFAppeal.com that "Muni owns the data in question and that the public is, of course, entitled to access it." Thanks in part to that statement, Peterson's lawyer was finally able to persuade Apple to restore Routesy to the iTunes App Store. (Though similar skirmishes with NBIS appear to be occuring in other cities.)
Finally, in Portland, Oregon, TriMet was one of the earliest transit authorities in the US to adopt an open data program and encourage independent developers. The result is a healthy and competitive application market that speaks for itself: Over 25 different mobile applicationsfrom different developers make creative use of the data. And, the open data program enabled Portland Airport to display real-time train arrival information at their baggage claims — with no additional work required on TriMet's part. TriMet's Bibiana McHugh explains: "Before, we would have needed to work with a technical team for the airport to make this happen, but with developer.trimet.org, we just make the information available once and our work is done."
If other government data-sets are any indication, the transit apps we've seen so far are just the beginning of what's possible. Just take a look at the impressive winners of Sunlight Foundation's Apps For America contest.
For reasons both legal and practical, transit authorities should follow the lead of SFMTA, TriMet and the Obama Administration's Data.gov, and allow independent developers to freely use their data. The results so far have been a better deal for passengers and taxpayers alike.