| 51 |
Data Cleaning for Accurate, Fair, and Robust Models: A Big Data - AI Integration Approach
Ki Hyun Tae, Yuji Roh, ... (+3 more)
|
👻
Ghosted
|
cs.DB
|
77 |
7 years ago |
| 52 |
Join Processing for Graph Patterns: An Old Dog with New Tricks
Dung Nguyen, Molham Aref, ... (+5 more)
|
👻
Ghosted
|
cs.DB
|
76 |
11 years ago |
| 53 |
A Layered Aggregate Engine for Analytics Workloads
Maximilian Schleich, Dan Olteanu, ... (+3 more)
|
👻
Ghosted
|
cs.DB
|
75 |
6 years ago |
| 54 |
Database Learning: Toward a Database that Becomes Smarter Every Time
Yongjoo Park, Ahmad Shahab Tajik, ... (+2 more)
|
👻
Ghosted
|
cs.DB
|
75 |
9 years ago |
| 55 |
Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets
Fernando Chirigati, Harish Doraiswamy, ... (+2 more)
|
👻
Ghosted
|
cs.DB
|
74 |
9 years ago |
| 56 |
Distance-generalized Core Decomposition
Francesco Bonchi, Arijit Khan, Lorenzo Severini
|
👻
Ghosted
|
cs.DS
|
73 |
7 years ago |
| 57 |
Caffe con Troll: Shallow Ideas to Speed Up Deep Learning
Stefan Hadjis, Firas Abuzaid, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
73 |
11 years ago |
| 58 |
Controlling False Discoveries During Interactive Data Exploration
Zheguang Zhao, Lorenzo De Stefani, ... (+4 more)
|
👻
Ghosted
|
cs.DB
|
72 |
9 years ago |
| 59 |
BlindFL: Vertical Federated Machine Learning without Peeking into Your Data
Fangcheng Fu, Huanran Xue, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
72 |
3 years ago |
| 60 |
AliCoCo: Alibaba E-commerce Cognitive Concept Net
Xusheng Luo, Luxin Liu, ... (+7 more)
|
👻
Ghosted
|
cs.IR
|
71 |
6 years ago |
| 61 |
Efficient Exact Algorithms for Maximum Balanced Biclique Search in Bipartite Graphs
Lu Chen, Chengfei Liu, ... (+3 more)
|
👻
Ghosted
|
cs.DS
|
71 |
5 years ago |
| 62 |
Apache Hive: From MapReduce to Enterprise-grade Big Data Warehousing
Jesús Camacho-Rodríguez, Ashutosh Chauhan, ... (+19 more)
|
👻
Ghosted
|
cs.DB
|
71 |
7 years ago |
| 63 |
Computing Optimal Repairs for Functional Dependencies
Ester Livshits, Benny Kimelfeld, Sudeepa Roy
|
👻
Ghosted
|
cs.DB
|
71 |
8 years ago |
| 64 |
A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs
Elias Stehle, Hans-Arno Jacobsen
|
👻
Ghosted
|
cs.DB
|
71 |
9 years ago |
| 65 |
IDEBench: A Benchmark for Interactive Data Exploration
Philipp Eichmann, Carsten Binnig, ... (+2 more)
|
👻
Ghosted
|
cs.DB
|
69 |
8 years ago |
| 66 |
Design Principles for Scaling Multi-core OLTP Under High Contention
Kun Ren, Jose M. Faleiro, Daniel J. Abadi
|
👻
Ghosted
|
cs.DB
|
69 |
10 years ago |
| 67 |
Croissant: A Metadata Format for ML-Ready Datasets
Mubashara Akhtar, Omar Benjelloun, ... (+29 more)
|
👻
Ghosted
|
cs.LG
|
68 |
2 years ago |
| 68 |
Theoretically-Efficient and Practical Parallel DBSCAN
Yiqiu Wang, Yan Gu, Julian Shun
|
👻
Ghosted
|
cs.DS
|
67 |
6 years ago |
| 69 |
AC/DC: In-Database Learning Thunderstruck
Mahmoud Abo Khamis, Hung Q. Ngo, ... (+3 more)
|
👻
Ghosted
|
cs.DB
|
66 |
8 years ago |
| 70 |
Scaling Strongly Consistent Replication
Aleksey Charapko, Ailidani Ailijiang, Murat Demirbas
|
👻
Ghosted
|
cs.DC
|
65 |
6 years ago |
| 71 |
A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching
Venkata Vamsikrishna Meduri, Lucian Popa, ... (+2 more)
|
👻
Ghosted
|
cs.DB
|
64 |
6 years ago |
| 72 |
Shortest Paths and Distances with Differential Privacy
Adam Sealfon
|
👻
Ghosted
|
cs.CR
|
64 |
10 years ago |
| 73 |
SLiMFast: Guaranteed Results for Data Fusion and Source Reliability
Manas Joglekar, Theodoros Rekatsinas, ... (+3 more)
|
👻
Ghosted
|
cs.DB
|
63 |
10 years ago |
| 74 |
Black or White? How to Develop an AutoTuner for Memory-based Analytics [Extended Version]
Mayuresh Kunjir, Shivnath Babu
|
👻
Ghosted
|
cs.DC
|
62 |
6 years ago |
| 75 |
Ektelo: A Framework for Defining Differentially-Private Computations
Dan Zhang, Ryan McKenna, ... (+5 more)
|
👻
Ghosted
|
cs.DB
|
62 |
7 years ago |
| 76 |
BPTree: an $\ell_2$ heavy hitters algorithm using constant memory
Vladimir Braverman, Stephen R. Chestnut, ... (+4 more)
|
👻
Ghosted
|
cs.DS
|
62 |
10 years ago |
| 77 |
LaraDB: A Minimalist Kernel for Linear and Relational Algebra Computation
Dylan Hutchison, Bill Howe, Dan Suciu
|
👻
Ghosted
|
cs.DB
|
61 |
9 years ago |
| 78 |
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis
Chao Zhang, Yuren Mao, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
60 |
2 years ago |
| 79 |
Cheetah: Accelerating Database Queries with Switch Pruning
Muhammad Tirmazi, Ran Ben Basat, ... (+2 more)
|
👻
Ghosted
|
cs.DB
|
60 |
6 years ago |
| 80 |
Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations (Extended Version)
Yingjun Wu, Jia Yu, ... (+3 more)
|
👻
Ghosted
|
cs.DB
|
60 |
7 years ago |
| 81 |
Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications
Maaz Bin Safeer Ahmad, Alvin Cheung
|
👻
Ghosted
|
cs.DB
|
58 |
8 years ago |
| 82 |
From Group Recommendations to Group Formation
Senjuti Basu Roy, Laks V. S. Lakshmanan, Rui Liu
|
👻
Ghosted
|
cs.IR
|
57 |
11 years ago |
| 83 |
Regular Path Query Evaluation on Streaming Graphs
Anil Pacaci, Angela Bonifati, M. Tamer Özsu
|
👻
Ghosted
|
cs.DB
|
56 |
6 years ago |
| 84 |
Navigating the Data Lake with Datamaran: Automatically Extracting Structure from Log Datasets
Yihan Gao, Silu Huang, Aditya Parameswaran
|
👻
Ghosted
|
cs.DB
|
55 |
8 years ago |
| 85 |
Locating a Small Cluster Privately
Kobbi Nissim, Uri Stemmer, Salil Vadhan
|
👻
Ghosted
|
cs.DS
|
55 |
10 years ago |
| 86 |
The Machine Learning Bazaar: Harnessing the ML Ecosystem for Effective System Development
Micah J. Smith, Carles Sala, ... (+2 more)
|
👻
Ghosted
|
cs.SE
|
54 |
6 years ago |
| 87 |
Answering (Unions of) Conjunctive Queries using Random Access and Random-Order Enumeration
Nofar Carmeli, Shai Zeevi, ... (+3 more)
|
👻
Ghosted
|
cs.DB
|
54 |
6 years ago |
| 88 |
One SQL to Rule Them All
Edmon Begoli, Tyler Akidau, ... (+4 more)
|
👻
Ghosted
|
cs.DB
|
53 |
6 years ago |
| 89 |
RisGraph: A Real-Time Streaming System for Evolving Graphs to Support Sub-millisecond Per-update Analysis at Millions Ops/s
Guanyu Feng, Zixuan Ma, ... (+5 more)
|
👻
Ghosted
|
cs.DB
|
52 |
6 years ago |
| 90 |
Management of Machine Learning Lifecycle Artifacts: A Survey
Marius Schlegel, Kai-Uwe Sattler
|
👻
Ghosted
|
cs.DB
|
52 |
3 years ago |
| 91 |
Causal Relational Learning
Babak Salimi, Harsh Parikh, ... (+4 more)
|
👻
Ghosted
|
cs.DB
|
51 |
6 years ago |
| 92 |
BriskStream: Scaling Data Stream Processing on Shared-Memory Multicore Architectures
Shuhao Zhang, Jiong He, ... (+2 more)
|
👻
Ghosted
|
cs.DB
|
51 |
7 years ago |
| 93 |
Causal Feature Selection for Algorithmic Fairness
Sainyam Galhotra, Karthikeyan Shanmugam, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
50 |
5 years ago |
| 94 |
Computing Join Queries with Functional Dependencies
Mahmoud Abo Khamis, Hung Q. Ngo, Dan Suciu
|
👻
Ghosted
|
cs.DB
|
50 |
10 years ago |
| 95 |
SLING: A Near-Optimal Index Structure for SimRank
Boyu Tian, Xiaokui Xiao
|
👻
Ghosted
|
cs.DB
|
50 |
10 years ago |
| 96 |
Joining Extractions of Regular Expressions
Dominik D. Freydenberger, Benny Kimelfeld, Liat Peterfreund
|
👻
Ghosted
|
cs.DB
|
49 |
9 years ago |
| 97 |
Verification of Hierarchical Artifact Systems
Alin Deutsch, Yuliang Li, Victor Vianu
|
👻
Ghosted
|
cs.DB
|
49 |
10 years ago |
| 98 |
On the Complexity of Inner Product Similarity Join
Thomas D. Ahle, Rasmus Pagh, ... (+2 more)
|
👻
Ghosted
|
cs.DS
|
49 |
10 years ago |
| 99 |
Complaint-driven Training Data Debugging for Query 2.0
Weiyuan Wu, Lampros Flokas, ... (+2 more)
|
👻
Ghosted
|
cs.DB
|
48 |
6 years ago |
| 100 |
A Cost-based Optimizer for Gradient Descent Optimization
Zoi Kaoudi, Jorge-Arnulfo Quiané-Ruiz, ... (+3 more)
|
👻
Ghosted
|
cs.DB
|
48 |
9 years ago |