Photon
Citations Over TimeTop 1% of 2013 papers
Abstract
Web-based enterprises process events generated by millions of users interacting with their websites. Rich statistical data distilled from combining such interactions in near real-time generates enormous business value. In this paper, we describe the architecture of Photon, a geographically distributed system for joining multiple continuously flowing streams of data in real-time with high scalability and low latency, where the streams may be unordered or delayed. The system fully tolerates infrastructure degradation and datacenter-level outages without any manual intervention. Photon guarantees that there will be no duplicates in the joined output (at-most-once semantics) at any point in time, that most joinable events will be present in the output in real-time (near-exact semantics), and exactly-once semantics eventually.
Related Papers
- → Teacher-Student Learning for Low-Latency Online Speech Enhancement Using Wave-U-Net(2021)23 cited
- → Challenges in Mining Big Data Streams(2018)6 cited
- → Identifying the Challenges in Reducing Latency in GSN using Predictors(2009)2 cited
- → An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR(2021)