| Field | Value | Language |
| dc.contributor.author | Chin, Teck Kean | |
| dc.contributor.author | Xian, Tingsen (Tim) | |
| dc.contributor.author | Marks, Benjy | |
| dc.contributor.author | Nelson, John D. | |
| dc.contributor.author | Moylan, Emily | |
| dc.coverage.spatial | New South Wales, Australia | en |
| dc.coverage.temporal | June 2020 to June 2022 | en |
| dc.date.accessioned | 2022-09-19T04:00:24Z | |
| dc.date.available | 2022-09-19T04:00:24Z | |
| dc.date.issued | 2022-09-19 | |
| dc.identifier.uri | https://hdl.handle.net/2123/29562 | |
| dc.description.abstract | Cities generate large volumes of data daily through digital services and smart city applications, these include Public Transport Authorities which generate big data as part of their daily operations, such as vehicle positions, counts of passengers and user travel patterns. The General Transit Feed Specification (GTFS) is a data format that allows public transport data to be consumed by a wide variety of software applications. This paper presents a data pipeline developed to manipulate the GTFS feeds into a general and flexible dataset of realtime transit arrivals. There are three barriers to widespread access to the information addressed by creating a one-size-fits-all data pipeline for realtime operations from GTFS. First, the protocol buffer format is not human readable and requires processing before use in most transport applications. Secondly, the general specification does vary place-to-place and the conditionally required and optional fields are inconsistent between locations. Thirdly, the raw data may contain errors including missing stop sequence or a reverse direction bus being detected in the bus stop area. The pipeline is constructed of set of data cleaning and transformation steps to address these challenges. The paper briefly presents a potential use cases of the processed data to illustrate its relevance to researchers and practitioners. | en |
| dc.language.iso | en | en |
| dc.rights | Creative Commons Attribution 4.0 | en |
| dc.subject | Big Data | en |
| dc.subject | Open Data | en |
| dc.subject | General Transit Feed Specification (GTFS) | en |
| dc.subject | GTFS-S | en |
| dc.subject | GTFS-R | en |
| dc.subject | Public Transport | en |
| dc.subject | Transport Performance | en |
| dc.title | Data pipeline for GTFS transit arrival and departure information | en |
| dc.type | Dataset | en |
| dc.subject.asrc | 08 Information and Computing Sciences | en |
| dc.subject.asrc | 0804 Data Format | en |
| dc.subject.asrc | 0905 Civil Engineering | en |
| dc.subject.asrc | 10 Technology | en |
| dc.subject.asrc | 1205 Urban and Regional Planning | en |
| dc.identifier.doi | 10.25910/1pfb-4z05 | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/13_Cleaned_Daily_TU.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2020_06.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2020_07.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2020_08.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2020_09.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2020_10.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2020_11.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2020_12.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_01.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_02.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_03.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_04.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_05.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_06.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_07.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_08.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_09.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_10.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_11.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2021_12.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2022_01.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2022_02.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2022_03.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2022_04.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2022_05.7z | |
| dc.bitstream.url | https://ses-data.library.sydney.edu.au/public/29562_Chin/Transformed_TU_2022_06.7z | |
| usyd.faculty | SeS faculties schools::The University of Sydney Business School::Institute of Transport and Logistics Studies (ITLS) | en |
| workflow.metadata.only | No | en |