Subscribing to Big Data at Scale

September 10, 2020 · Declared Dead · 🏛 Distributed and parallel databases

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Xikui Wang, Michael J. Carey, Vassilis J. Tsotras arXiv ID 2009.04611 Category cs.DB: Databases Cross-listed cs.DC Citations 5 Venue Distributed and parallel databases Last Checked 4 months ago

Abstract

Today, data is being actively generated by a variety of devices, services, and applications. Such data is important not only for the information that it contains, but also for its relationships to other data and to interested users. Most existing Big Data systems focus on passively answering queries from users, rather than actively collecting data, processing it, and serving it to users. To satisfy both passive and active requests at scale, users need either to heavily customize an existing passive Big Data system or to glue multiple systems together. Either choice would require significant effort from users and incur additional overhead. In this paper, we present the BAD (Big Active Data) system, which is designed to preserve the merits of passive Big Data systems and introduce new features for actively serving Big Data to users at scale. We show the design and implementation of the BAD system, demonstrate how BAD facilitates providing both passive and active data services, investigate the BAD system's performance at scale, and illustrate the complexities that would result from instead providing BAD-like services with a "glued" system.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Databases

R.I.P. 👻 Ghosted

The Case for Learned Index Structures

Tim Kraska, Alex Beutel, ... (+3 more)

cs.DB 🏛 SIGMOD 📚 1.2K cites 8 years ago

R.I.P. 👻 Ghosted

Untangling Blockchain: A Data Processing View of Blockchain Systems

Tien Tuan Anh Dinh, Rui Liu, ... (+4 more)

cs.DB 🏛 IEEE TKDE 📚 997 cites 8 years ago

R.I.P. 👻 Ghosted

Converting Static Image Datasets to Spiking Neuromorphic Datasets Using Saccades

Garrick Orchard, Ajinkya Jayawant, ... (+2 more)

cs.DB 🏛 Frontiers in Neuroscience 📚 905 cites 10 years ago

R.I.P. 👻 Ghosted

BLOCKBENCH: A Framework for Analyzing Private Blockchains

Tien Tuan Anh Dinh, Ji Wang, ... (+4 more)

cs.DB 🏛 SIGMOD 📚 872 cites 9 years ago

R.I.P. 👻 Ghosted

Data Synthesis based on Generative Adversarial Networks

Noseong Park, Mahmoud Mohammadi, ... (+4 more)

cs.DB 🏛 VLDB 📚 568 cites 8 years ago

R.I.P. 👻 Ghosted

HoloClean: Holistic Data Repairs with Probabilistic Inference

Theodoros Rekatsinas, Xu Chu, ... (+2 more)

cs.DB 🏛 VLDB 📚 544 cites 9 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago