The 2008 Transit Data Are Here!
posted in Mission, Transportation, Useful Data |The Antiplanner is a data junkie. When the Federal Transit Administration publishes each year’s National Transit Database, I feel like running down the street like Steve Martin in The Jerk: “The 2008 Transit Database is here! And I’m in it!” (I did, after all, ride the Washington Metro more than a few times in 2008.)

As in previous years, the 2008 database comes in two very different formats: the database and the data tables. Both are self-extracting exe files (if, like the Antiplanner, you have a Macintosh, you may need a special program such as File Juicer to extract the data). Each has more-or-less the same data, but the data tables are easier to read while the database is easier to manipulate in an Excel spreadsheet.
The Federal Transit Administration posted the 2008 transit data about a week ago. Ridership, operating costs, capital costs, and other information are contained in separate files. To make them easier to use, I’ve put the data I most often access in one file (2.4 mb). This adds to my collection of similar files for 2007, 2006, and 2005.
All these annual files have the same basic format. This year, the first 1480 rows list every mode of transit provided by every transit agency that reported to the FTA. Then there are about 14 rows that summarize the data by mode — light rail, heavy rail, motor buses, trolley buses, etc. Because energy and carbon dioxide data are not available for all transit systems, I repeated those 14 rows summarizing just those systems for which energy data are available. Finally, rows 1520 through 1869 summarize the data for each of 350 urbanized areas.
The data columns in all three parts of the spreadsheet are the same: trips, passenger miles (PM), vehicle revenue miles (VRM), vehicle revenue hours (VRH), operating costs, capital costs, fares, vehicles, seats, standing room, average number of seats per vehicle, average standing room per vehicle, BTUs of energy consumption, pounds of CO2 emissions, and directional route miles of rail transit lines (DRM).
Most of these data are taken directly from the database. The average numbers of seats and standing room are calculated by dividing total seats and standing room by numbers of vehicles. BTUs of energy and CO2 emissions are calculated by multiplying the gallons, kilowatt hours, or other fuel volumes by standard factors, most of which are published by the Energy Information Agency. The BTUs required to produce a kilowatt hour are from table B.6 of the Transportation Energy Data Base, and the pounds of CO2 per kilowatt hour vary by the mix of electrical energy produced in each state.
The remaining columns are all calculations based on the preceding columns: BTUs and CO2 per passenger mile, operating costs, capital costs, and fares per trip and per passenger mile, the percentage of seats filled and the percentage of capacity filled. I’ll probably add more calculations later and may post an update then.
To better understand the data, I added a new mode of transit: vintage trolleys. The FTA classifies vintage trollies and streetcars as light rail, but they are really very different. I classified the Tacoma streetcar as a vintage trolley even though it makes no pretense of being vintage. The data for the Portland streetcar are mixed with Portland’s light rail.
I also adjusted the urbanized areas for some commuter-rail lines. Maryland transportation officials tell me that 90 percent of their commuter-rail customers go to DC, not Baltimore, so I put Maryland commuter rail in the DC urban area. Similarly, I put Connecticut commuter trains in New York, the Portland, Maine commuter rail in Boston, the Altamont Commuter Express in San Jose, and the planned Delaware commuter rail in Philadelphia. New Jersey has three light-rail lines, two in the New York urban area and one in Philadelphia, but the data are not separated so I grouped them all in New York.
A few early observations:
First, thanks to the increased ridership due to high gas prices, most forms of transit were about 1 to 2 percent more energy efficient in 2008 than in 2007. An exception was the demand-responsive buses (paratransit), but this appears to be more because more transit agencies reported the fuel used for such routes in 2008 than in 2007. Interestingly, the most energy-efficient forms of transit are vanpools and the Puerto Rican públicos or public cars, a private transit service that is almost entirely unsubsidized (the only subsidy seems to be to the Puerto Rican government agency that monitors the public cars).
Second, Denver still has the emptiest light-rail cars in the nation, carrying an average of 14 riders compared with a national average of 24. San Jose is second with just 16 riders per car. I don’t know much about the North (San Diego) County Transit District; it claims to have the fullest light-rail cars, with 48 riders, but it also admits to having by far the heaviest subsidies per trip. This seems peculiar.
Third, there are some crazy commuter-rail lines and proposals out there. Commuter rail seems to be the poor city’s light rail (which in turn is the middle city’s heavy rail): cities build it because it uses existing rail lines and seems cheap, but compared with buses it is ridiculous.
Nashville, for example, spent just $33 million to start running commuter trains on a 31-mile line shared with freight trains. But the “Music City Star” carried an average of just 264 commuters to and from work on weekdays in 2008, and taxpayers had to pay more than $3 per passenger mile subsidizing train operations (compared with less than $1 for Nashville buses and 50 cents for transit buses nationwide).
Portland’s commuter rail is too recent for the 2008 data base, but in September it carried an average of just 563 commuters to work each weekday. Commuter-rail lines in Seattle, Dallas, and Miami-Ft. Lauderdale aren’t much better; Seattle and Dallas both lose more than $2 per passenger mile on operations. Operating the Ft. Lauderdale trains only loses about $1 per passenger mile, but they recently spent hundreds of millions of dollars double-tracking the route and have only a few new passengers to show for it. Proposed lines in Akron, Atlanta, Denver, Raleigh, Salt Lake City, and elsewhere are not likely to do much better than these.
In any case, I hope those of you who are also data junkies find the spreadsheet useful.




posted on October 28th, 2009 at 9:53 am
posted on October 28th, 2009 at 10:43 am
posted on October 28th, 2009 at 10:44 am
posted on October 28th, 2009 at 11:05 am
posted on October 28th, 2009 at 1:36 pm
posted on October 28th, 2009 at 5:39 pm
posted on October 29th, 2009 at 1:23 am
posted on October 29th, 2009 at 12:55 pm
posted on October 30th, 2009 at 3:17 am
posted on October 31st, 2009 at 10:44 am
posted on November 2nd, 2009 at 12:01 am