Timezone: »
Efficient Location Sampling Algorithms for Road Networks
Sara Ahmadian · Kostas Kollias · Ameya Velingker · Sreenivas Gollapudi · Vivek Kumar · Santhoshini Velusamy
Event URL: https://openreview.net/forum?id=fPcIwNb3qI »
Many geographic information systems applications rely on data provided by user devices in the road network, including traffic monitoring, driving navigation, and road closure detection. The underlying signal is generally collected by sampling locations from user trajectories. The sampling process, though critical for various applications, has not been studied sufficiently in the literature. While the most natural way to sample a trajectory may be to use a frequency based algorithm, e.g., sampling locations every $x$ seconds, such a sampling strategy can be quite wasteful in resources (e.g., server-side processing, user battery) as well as stored user data. In this work, we conduct a horizontal study of various location sampling algorithms (based on frequency, road geography, reservoir sampling, etc.) on the road network of New York City and assess their trade-offs in terms of various metrics of interest, such as the size of the stored data and the induced quality of training for prediction tasks (e.g., predicting speeds).
Many geographic information systems applications rely on data provided by user devices in the road network, including traffic monitoring, driving navigation, and road closure detection. The underlying signal is generally collected by sampling locations from user trajectories. The sampling process, though critical for various applications, has not been studied sufficiently in the literature. While the most natural way to sample a trajectory may be to use a frequency based algorithm, e.g., sampling locations every $x$ seconds, such a sampling strategy can be quite wasteful in resources (e.g., server-side processing, user battery) as well as stored user data. In this work, we conduct a horizontal study of various location sampling algorithms (based on frequency, road geography, reservoir sampling, etc.) on the road network of New York City and assess their trade-offs in terms of various metrics of interest, such as the size of the stored data and the induced quality of training for prediction tasks (e.g., predicting speeds).
Author Information
Sara Ahmadian (Google Research)
Kostas Kollias (Google Research)
Ameya Velingker (Google Research)
Sreenivas Gollapudi (Google Research)
Vivek Kumar (Banaras Hindu University, Dhirubhai Ambani Institute Of Information and Communication Technology)
Santhoshini Velusamy (Harvard University)
More from the Same Authors
-
2023 Poster: Exphormer: Sparse Transformers for Graphs »
Hamed Shirzad · Ameya Velingker · Balaji Venkatachalam · Danica J Sutherland · Ali K Sinop -
2023 Poster: Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization »
Ameya Velingker · Maximilian Vötsch · David Woodruff · Samson Zhou -
2022 Poster: Congested Bandits: Optimal Routing via Short-term Resets »
Pranjal Awasthi · Kush Bhatia · Sreenivas Gollapudi · Kostas Kollias -
2022 Spotlight: Congested Bandits: Optimal Routing via Short-term Resets »
Pranjal Awasthi · Kush Bhatia · Sreenivas Gollapudi · Kostas Kollias -
2019 Poster: Online Algorithms for Rent-Or-Buy with Expert Advice »
Sreenivas Gollapudi · Debmalya Panigrahi -
2019 Oral: Online Algorithms for Rent-Or-Buy with Expert Advice »
Sreenivas Gollapudi · Debmalya Panigrahi -
2019 Poster: Hiring Under Uncertainty »
Manish Purohit · Sreenivas Gollapudi · Manish Raghavan -
2019 Oral: Hiring Under Uncertainty »
Manish Purohit · Sreenivas Gollapudi · Manish Raghavan