Program Overview
The program overview can be accesed here.
Proceedings
SIGMOD 2024 papers can be accessed here.
PODS 2024 papers can be accessed here.
Conference Program: Sessions
- Opening Remarks
- Award Talks
- Keynotes
- Research Sessions
- Industry Sessions
- Demo Sessions
- Panel Discussions
- Posters
- Tutorials
- New Researcher Symposium
- Sponsors
- DEI
- PODS Special Event
OPENING REMARKS (INCLUDING 2 AWARD TALKS)
Tuesday June 11 8:30 am – 9:30 am
Location: Los Volcanes
- Welcome by General Chairs
- Welcome by Program Chairs
- ACM SIGMOD Doctoral Dissertation Awards:
- Daniel Kang (University of California, Los Angeles)
- ACM SIGMOD Systems Award:
- Apache SINGA: A Distributed Deep Learning System
AWARD TALKS
Wednesday June 12 8:30 am – 9:30 am
Location: Los Volcanes
- Research highlights Award
- Artifacts and Reproducibility Award:
- InfiniFilter: Expanding Filters to Infinity and Beyond. Niv Dayan, Ioana Bercea, Pedro Reviriego, and Rasmus Pagh
- Programming contest:
- Alaya (Southern University of Science and Technology, Zhejiang University)
- Distinguished AEs/PC members
- SIGMOD Research/Industry papers
- SIGMOD Research Implementation Strategies for Views over Property Graphs. Soonbo Han (University of Pennsylvania) and Zack Ives (University of Pennsylvania)
- SIGMOD Industry PolarDB-MP: A Multi-Primary Cloud-Native Database via Disaggregated Shared Memory. Xinjun Yang (Alibaba Group); Yingqiang Zhang (Alibaba Group); Hao Chen (Alibaba Group); Feifei Li (Alibaba Group); Bo Wang (Alibaba Group); Jing Fang (Alibaba Group); Chuan Sun (Alibaba Group); Yuhui Wang (Alibaba Group)
- PODS best paper:
- Consistency of Relations over Monoids Albert Atserias (Universitat Politècnica de Catalunya) and Phokion Kolaitis (UC Santa Cruz & IBM Research)
- History-Independent Dynamic Partitioning: Operation-Order Privacy in Ordered Data Structures Michael A. Bender (Stony Brook University), Martin Farach-Colton (New York University), Michael T. Goodrich (UC Irvine) and Hanna Komlós (New York University)
- Mendelzon PODS Test of Time Award:
- Composable Core-sets for Diversity and Coverage Maximization. Piotr Indyk (MIT), Sepideh Mahabadi (TTIC), Mohammad Mahdian (Google) and Vahab S. Mirrokni (Google)
Thursday June 13 8:30 am – 9:30 am
Location: Los Volcanes
- Best Demonstration Award
- SIGMOD Test of Time Award:
- PrivBayes: Private Data Release via Bayesian Networks. Jun Zhang, Graham Cormode, Cecilia M. Procopiuc, Divesh Srivastava, Xiaokui Xiao
- SIGMOD Contributions Award:
- Sihem Amer-Yahia (CNRS, Univ. Grenoble Alpes)
- CODD Innovations Award:
- Samuel Madden (MIT)
KEYNOTES
The Limitations of Data, ML & Us
Ricardo Baeza Yates,
Institute for Experiential AI, Northeastern University & DCC, Universidad de Chile
Tuesday June 11 9:30 am – 10:30 am
Location: Los Volcanes
Session Chair: Pablo Barceló
The Journey to a Knowledgeable Assistant with Retrieval-Augmented Generation (RAG)
Xin Luna Dong, Meta Reality Labs
Wednesday June 12 9:30 am – 10:30 am
Location: Los Volcanes
Session Chair: Alexandra Meliou
Making Data Management Better with Vectorized Query Processing
Peter Boncz, CWI Amsterdam and MotherDuck
Thursday June 13 9:30 am – 10:30 am
Location: Los Volcanes
Session Chair: S. Sudarshan
RESEARCH SESSIONS
Session 1: Storage 1
Tuesday June 11 1:00 pm – 2:30 pm
Location: Tupungato
Session Chair: Philippe Bonnet
-
OptiQL: Robust Optimistic Locking for Memory-Optimized Indexes
Ge Shi (Simon Fraser University)*; Ziyi Yan (Simon Fraser University); Tianzheng Wang (Simon Fraser University) -
ALP: Adaptive Lossless floating-Point Compression
Azim Afroozeh (CWI)*; Leonardo X Kuffo (CWI); Peter Boncz (Centrum Wiskunde & Informatica) -
Structural Designs Meet Optimality: Exploring Optimized LSM-tree Structures in A Colossal Configuration Space
Junfeng Liu (Nanyang Technological University)*; Fan Wang (Nanyang Technological University); Dingheng Mo (Nanyang Technological University); Siqiang Luo (Nanyang Technological University) -
GTS: GPU-based Tree Index for Fast Similarity Search
Yifan Zhu (Zhejiang University); Ruiyao Ma (Zhejiang University); Baihua Zheng (Singapore Management University); Xiangyu Ke (Zhejiang University, China); Lu Chen (Zhejiang University); Yunjun Gao (Zhejiang University)* -
CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure
Qiaolin Yu (Cornell University)*; Chang Guo (Arizona State University); Jay Zhuang (Independent Researcher); Viraj Thakkar (Arizona State University); Jianguo Wang (Purdue University); Zhichao Cao (Arizona State University) -
AirIndex: Versatile Index Tuning Through Data and Storage
Supawit Chockchowwat (University of Illinois at Urbana-Champaign)*; Wenjie Liu (University of Illinois at Urbana-Champaign); Yongjoo Park (University of Illinois at Urbana-Champaign)
Session 2: AI and ML in Databases (1)
Tuesday June 11 1:00 pm – 2:30 pm
Location: Llaima
Session Chair: Nick Koudas
-
Language-Model Based Informed Partition of Databases to Speed Up Pattern Mining
Carlos Bobed Lisbona (University of Zaragoza); Jordi Bernad (University of Zaragoza)*; Pierre Maillot (INRIA) -
Robustness of Updatable Learning-based Index Advisors
Yihang Zheng (Xiamen University); Chen Lin (Xiamen University)*; xian Lyu (Xiamen University); Xuanhe Zhou (Tsinghua); Guoliang Li (Tsinghua University); Tianqing Wang (Huawei) -
Settling Time vs. Accuracy Tradeoffs for Clustering Big Data
Andrew Draganov (Aarhus University)*; David Saulpic (ISTA); Chris Schwiegelshohn (Aarhus University) -
PACE: Poisoning Attacks on Learned Cardinality Estimation
Jintao Zhang (Tsinghua University); Guoliang Li (Tsinghua University)*; Chao Zhang (Tsinghua University); Chengliang Chai (Beijing Institute of Technology) -
Learning-based Property Estimation with Polynomials
Jiajun Li (Renmin University of China)*; Runlin Lei (Renmin University of China); Sibo Wang (The Chinese University of Hong Kong); Zhewei Wei (Renmin University of China); Bolin Ding ("Data Analytics and Intelligence Lab, Alibaba Group") -
Machine Unlearning in Learned Databases: An Experimental Analysis
Meghdad Kurmanji (University of Warwick)*; Eleni Triantafillou (Google); Peter Triantafillou (University of Warwick)
Session 3: Data Quality (1)
Tuesday June 11 1:00 pm – 2:30 pm
Location: Puyehue/Calbuco
Session Chair: Ziawasch Abedjan
-
SAGA: A Scalable Framework for Optimizing Data Cleaning Pipelines for Machine Learning Applications
Shafaq Siddiqi (Graz University of Technology)*; Roman Kern (Graz University of Technology); Matthias Boehm (Technische Universität Berlin) -
FedCSS: Joint Client-and-Sample Selection for Hard Sample-Aware Noise-Robust Federated Learning
Anran Li (Nanyang Technological University)*; Yue Cao (Nanyang Technological University); Jiabao Guo (Wuhan University); Hongyi Peng (Nanyang Technological University ); Qing Guo (A*STAR); Han Yu (Nanyang Technological University (NTU)) -
Splitting Tuples of Mismatched Entities
Wenfei Fan (Univ. of Edinburgh ); Ziyan Han (Beihang University); Weilong Ren (Shenzhen Institute of Computing Sciences)*; ding wang (Center of Inernet Governance Research of Tsinghua university); Yaoshu Wang (Shenzhen Institute of Computing Sciences, Shenzhen University); Min Xie (Shenzhen Institute of Computing Sciences); Mengyi Yan (Beihang University) -
Certain and Approximately Certain Models for Statistical Learning
Cheng Zhen (Oregon State University)*; Nischal Aryal (Oregon State University); Arash Termehchy (Oregon State University); Amandeep Singh Chabada (Oregon State University) -
In-Database Data Imputation
Massimo Perini (The University of Edinburgh); Milos Nikolic (University of Edinburgh)* -
Akane: Perplexity-Guided Time Series Data Cleaning
Xiaoyu Han (Fudan University)*; Haoran Xiong (Fudan University); Zhenying He (Fudan University); Peng Wang (" Fudan University, China"); Chen Wang (" Tsinghua University, China"); X. Sean Wang (Fudan University)
Session 4: Graphs (1)
Tuesday June 11 3:00 pm – 4:30 pm
Location: Parinacota
Session Chair: Yang Cao
-
Fast Maximal Quasi-clique Enumeration: A Pruning and Branching Co-Design Approach
Kaiqiang Yu (Nanyang Technological University)*; Cheng Long (Nanyang Technological University) -
Modularity-based Hypergraph Clustering: Random Hypergraph Model, Hyperedge-cluster Relation, and Computation
Zijin Feng (The Chinese University of Hong Kong)*; Miao Qiao (The University of Auckland); Hong Cheng (Chinese University of Hong Kong) -
Maximum k-Plex Computation: Theory and Practice
Lijun Chang (The University of Sydney)*; Kai Yao (The University of Sydney) -
Efficient Algorithm for Budgeted Adaptive Influence Maximization: An Incremental RR-set Update Approach
Qintian Guo (The Chinese University of Hong Kong); Chen Feng (The Chinese University of Hong Kong); Fangyuan ZHANG (The Chinese University of Hong Kong); Sibo Wang (The Chinese University of Hong Kong)* -
Efficient Maximum k-Defective Clique Computation with Improved Time Complexity
Lijun Chang (The University of Sydney)* -
Scalable Approximate Butterfly and Bi-triangle Counting for Large Bipartite Networks
Fangyuan Zhang (The Chinese University of Hong Kong); Dechuang CHEN (The Chinese University of Hong Kong); Sibo Wang (The Chinese University of Hong Kong)*; Yin Yang (Hamad bin Khalifa University); Junhao Gan (University of Melbourne)
Session 5: Query Processing (1)
Tuesday June 11 3:00 pm – 4:30 pm
Location: Tupungato
Session Chair: Aidan Hogan
-
Optimizing Disjunctive Queries with Tagged Execution
Albert Kim (MIT)*; Samuel Madden (MIT) -
Optimizing Nested Recursive Queries
Amir Shaikhha (University of Edinburgh)*; Dan Suciu (University of Washington); Maximilian Schleich (RelationalAI); Hung Ngo (RelationalAI) -
A Unified Approach for Resilience and Causal Responsibility with Integer Linear Programming (ILP) and LP Relaxations
Neha Makhija (Northeastern University)*; Wolfgang Gatterbauer (Northeastern University) -
Selectivity Estimation for Queries Containing Predicates over Set-Valued Attributes
Zizhong Meng (Nanyang Technological University)*; Xin Cao (University of New South Wales); Gao Cong (Nanyang Technological Univesity) -
Sub-optimal Join Order Identification with L1-error
Yesdaulet Izenov (University of California, Merced)*; Asoke Datta (University of California, Merced); Brian Tsan (UC Merced); Florin Rusu (UC Merced) -
PLAQUE: Automated Predicate Learning at Query Time
Yiming Lin (University of California, Berkeley)*; Sharad Mehrotra (U.C. Irvine)
Session 6: Data Systems for AI and ML (1)
Tuesday June 11 3:00 pm – 4:30 pm
Location: Llaima
Session Chair: Matthias Boehm
-
CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models
Hailin Zhang (Peking University)*; Zirui Liu (Peking University); Boxuan Chen (Peking University); Yikai Zhao (Peking University); Tong Zhao (Peking University); Tong Yang (Peking University); Bin Cui (Peking University) -
The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage Format
Utku Sirin (Harvard University)*; Stratos Idreos (Harvard) -
STile: Searching Hybrid Sparse Formats for Sparse Deep Learning Operators Automatically
Jingzhi Fang (HKUST)*; Yanyan Shen (Shanghai Jiao Tong University); Yue Wang (Shenzhen Institute of Computing Sciences); Lei Chen (Hong Kong University of Science and Technology) -
SIMPLE: Efficient Temporal Graph Neural Network Training at Scale with Dynamic Data Placement
Shihong Gao (The Hong Kong University of Science and Technology)*; Yiming Li (Hong Kong University of Science and Technology); Xin Zhang (Hong Kong University of Science and Technology); Yanyan Shen (Shanghai Jiao Tong University); Yingxia Shao (BUPT); Lei Chen (Hong Kong University of Science and Technology) -
On Efficient Large Sparse Matrix Chain Multiplication
Chunxu Lin (The Chinese University of Hong Kong,Shenzhen)*; Wensheng Luo (School of Data Science, The Chinese University of Hong Kong, Shenzhen); Yixiang Fang (The Chinese University of Hong Kong, Shenzhen); Chenhao Ma (The Chinese University of Hong Kong, Shenzhen); Xilin Liu (Huawei); YUCHI MA (HUAWEI CLOUD) -
FACET: Robust Counterfactual Explanation Analytics
Peter M VanNostrand (WPI)*; Huayi Zhang (WPI); Dennis M Hofmann (Worcester Polytechnic Institute); Elke A Rundensteiner (Worcester Polytechnic Institute)
Session 7: Data Quality (2) + Security (1)
Tuesday June 11 3:00 pm – 4:30 pm
Location: Puyehue/Calbuco
Session Chair: Felix Naumann
-
Missing Data Imputation with Uncertainty-Driven Network
Jianwei Wang (University of New South Wales)*; Ying Zhang (University of Technology Sydney); Kai Wang (Shanghai Jiao Tong University); Xuemin Lin (Shanghai Jiaotong University); Wenjie Zhang (University of New South Wales) -
OTClean: Data Cleaning for Conditional Independence Violations using Optimal Transport
Babak Salimi (University of California at San Diego)*; Mostafa Milani (The University of Western Ontario); Alireza Pirhadi (The University of Western Ontario); Alexander Cloninger (University of California San Diego); Mohammad Hossein Moslemi (University of Western Ontario) -
Towards Metric DBSCAN: Exact, Approximate, and Streaming Algorithms
Mo Guanlin (University of Science and Technology of China); Shihong Song (University of Science and Technology of China); Hu Ding (University of Science and Technology of China)* -
WeBridge: Synthesizing Stored Procedures for Large-Scale Real-World Web Applications
Gansen Hu (Shanghai Jiao Tong University )*; zhaoguo wang (Shanghai Jiao Tong University); Jiahuan Shen (Shanghai Jiao Tong University); Zhiyuan Dong (Shanghai Jiao Tong University); Chuzhe Tang (Shanghai Jiao Tong University); Sheng Yao (Shanghai Jiao Tong University); Haibo Chen (Shanghai Jiao Tong University) -
TEE-based General-purpose Computational Backend for Secure Delegated Data Processing
Mo Sha (Alibaba Group)*; Jialin Li (NUS); Sheng Wang (Alibaba Group); Feifei Li (Alibaba Group); Kian-Lee Tan (National University of Singapore) -
Waffle: An Online Oblivious Datastore for Protecting Data Access Patterns
Sujaya Maiyya (University of Waterloo)*; Sharath Chandra Vemula (UC Santa Barbara); Divy Agrawal (University of California, Santa Barbara); Amr El Abbadi (UC Santa Barbara); Florian Kerschbaum (University of Waterloo)
Session 8: Graphs (2)
Tuesday June 11 5:00 pm – 6:30 pm
Location: Parinacota
Session Chair: Xiaokui Xiao
-
Efficient k-Clique Listing: An Edge-Oriented Branching Strategy
Kaixin Wang (Nanyang Technological University); Kaiqiang Yu (Nanyang Technological University); Cheng Long (Nanyang Technological University)* -
Efficient High-Quality Clustering for Large Bipartite Graphs
Renchi Yang (Hong Kong Baptist University)*; Jieming Shi (The Hong Kong Polytechnic University) -
Efficient Core Maintenance in Large Bipartite Graphs
Wensheng Luo (School of Data Science, The Chinese University of Hong Kong, Shenzhen)*; Qiaoyuan Yang (CUHK-Shenzhen); Yixiang Fang (The Chinese University of Hong Kong, Shenzhen); Xu Zhou (Hunan university) -
Efficient Maximal Biplex Enumerations with Improved Worst-Case Time Guarantee
Qiangqiang Dai (Beijing Institute of Technology); Ronghua Li (Beijing Institute of Technology)*; Donghang Cui (Beijing Institute of Technology); Meihao Liao (Beijing Institute of Technology); Yu-Xuan Qiu (Shenzhen University); Guoren Wang (Beijing Institute of Technology) -
HERO: A Hierarchical Set Partitioning and Join Framework for Speeding up the Set Intersection Over Graphs
Boyu Yang (Fudan University ); Weiguo Zheng (Fudan University)*; Xiang Lian (Kent State University); Yuzheng Cai (Fudan University); X. Sean Wang (Fudan University) -
A Comprehensive Survey and Experimental Study of Subgraph Matching: Trend, Unbiasedness, and Interaction
Zhijie Zhang (Fudan University); Yujie Lu (Fudan University); Weiguo Zheng (Fudan University)*; Xuemin Lin (Shanghai Jiaotong University)
Session 9: Query Processing (2)
Tuesday June 11 5:00 pm – 6:30 pm
Location: Tupungato
Session Chair: Ke Yi
-
ROME: Robust Query Optimization via Parallel Multi-Plan Execution
Ziyun Wei (Cornell University)*; Immanuel Trummer (Cornell University) -
Proving Query Equivalence Using Linear Integer Arithmetic
Haoran Ding (Shanghai Jiao Tong University); zhaoguo wang (Shanghai Jiao Tong University)*; Yicun Yang (Shanghai Jiao Tong University); Dexin Zhang (Shanghai Jiao Tong University); Zhenglin Xu (Shanghai Jiao Tong University); Haibo Chen (Shanghai Jiao Tong University); Ruzica Piskac (Yale University); Jinyang Li (New York University) -
FedKNN: Secure Federated k-Nearest Neighbor Search
Xinyi Zhang (Hong Kong Baptist University); Qichen Wang (Hong Kong Baptist University); Cheng Xu (Hong Kong Baptist University); Yun PENG (Guangzhou University); Jianliang Xu (Hong Kong Baptist University)* -
Relational Algorithms for Top-k Query Evaluation
Qichen Wang (Hong Kong Baptist University)*; Qiyao Luo (Hong Kong University of Science and Technology); Yilei Wang (Alibaba Cloud) -
Efficient Approximation Framework for Attribute Recommendation
Xingguang Chen (The Chinese University of Hong Kong); Fangyuan ZHANG (The Chinese University of Hong Kong); Jinchao Huang (The Chinese University of Hong Kong); Sibo Wang (The Chinese University of Hong Kong)* -
Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data Segment
Mengzhao Wang (Zhejiang University)*; Weizhi Xu (Zilliz); Xiaomeng Yi (Zhejiang Lab); Songlin Wu (Tongji University); Zhangyang Peng (Hangzhou Dianzi University); Xiangyu Ke (Zhejiang University, China); Yunjun Gao (Zhejiang University); Xiaoliang Xu (Hangzhou Dianzi University); Rentong Guo (Zilliz); Charles Xie (Zilliz)
Session 10: Data Exploration (1)
Tuesday June 11 5:00 pm – 6:30 pm
Location: Llaima
Session Chair: Sharad Mehrotra
-
Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study
Yang Wu (Huazhong University of Science and Technology)*; Yao Wan (Huazhong University of Science and Technology); Hongyu Zhang (University of Newcastle); Yulei Sui (UNSW Sydney); wucai wei (Huazhong University of Science and Technology); Wei Zhao (Huazhong University of Science and Technology); Guandong Xu (University of Technology Sydney, Australia); Hai Jin (Huazhong University of Science and Technology) -
Optimizing Dataflow Systems for Scalable Interactive Visualization
Junran Yang (University of Washington)*; Hyekang Kevin Joo (Carnegie Mellon University); Sai Yerramreddy (University of Maryland); Dominik Moritz (Carnegie Mellon University); Leilani Battle (University of Washington) -
Time Series Representation for Visualization in Apache IoTDB
Lei Rui (Tsinghua University); Xiangdong Huang (Tsinghua University); Shaoxu Song (Tsinghua University)*; Yuyuan Kang (University of Wisconsin-Madison); Chen Wang (Timecho Limited); Jianmin Wang ("Tsinghua University, China") -
Dias: Dynamic Rewriting of Pandas Code
Stefanos Baziotis (University of Illinois at Urbana-Champaign)*; Daniel Kang (UIUC); Charith Mendis (University of Illinois at Urbana-Champaign) -
On The Reasonable Effectiveness of Relational Diagrams: Explaining Relational Query Patterns and the Pattern Expressiveness of Relational Languages
Wolfgang Gatterbauer (Northeastern University)*; Cody Dunne (Northeastern University) -
Summarized Causal Explanations For Aggregate Views
Brit Youngmann (Technion - Israel institute of technology)*; Michael Cafarella (MIT CSAIL); Amir Gilad (The Hebrew University); Sudeepa Roy (Duke University, USA)
Session 11: Streams (1)
Tuesday June 11 5:00 pm – 6:30 pm
Location: Puyehue/Calbuco
Session Chair: Steffen Zeuch
-
Memory-Efficient and Flexible Detection of Heavy Hitters in High-Speed Networks
He Huang (Soochow University); Jiakun Yu (Soochow University); Yang Du (Soochow University)*; Jia Liu (Nanjing University); Haipeng Dai (Nanjing University); Yu-E Sun (Soochow University) -
Closest Pairs Search Over Data Stream
Rui Zhu (Shenyang Aerospace University)*; Bin Wang (Northeastern University); Xiaochun Yang (Northeastern University); Baihua Zheng (Singapore Management University) -
PECJ: Stream Window Join on Disorder Data Streams with Proactive Error Compensation
Xianzhi Zeng (Singapore University of Technology and Design); Shuhao Zhang (Nanyang Technological University)*; Hongbin HB Zhong (4paradigm); Hao Zhang (4Paradigm Inc.); mian lu (4Paradigm Inc.); zhao zheng (4Paradigm Inc.); Yuqiang Chen (4th Paradigm) -
Low-Latency Adaptive Distributed Stream Join System Based on a Flexible Join Model
Qihang Wang (Harbin Institute of Technology); Decheng Zuo (Harbin Institute of Technology); Zhan Zhang (Harbin Institute of Technology)*; Yanjun Shu (Harbin Institute of Technology); Xin Liu (Harbin Institute of Technology); Mingxuan He (Harbin Institute of Technology) -
DecoPa: Query Decomposition for Parallel Complex Event Processing
Samira Akili (HU Berlin )*; Steven Purtzel (Humboldt-Universität zu Berlin); Matthias Weidlich (Humboldt-Universität zu Berlin) -
Convolution and Cross-Correlation of Count Sketches Enables Fast Cardinality Estimation of Multi-Join Queries
Mike Heddes (University of California, Irvine)*; Igor Nunes (University of California, Irvine); Tony Givargis (University of California, Irvine); Alex Nicolau (UC Irvine)
Session 12: Benchmarking + Indexing
Tuesday June 11 5:00 pm – 6:30 pm
Location: Aconcagua
Session Chair: Milos Nikolic
-
Rethink of a Learned Cost Model: Why Start from Scratch?
JIANI YANG (Zhejiang University); Sai Wu (Zhejiang University)*; Dongxiang Zhang (Zhejiang University); Jian Dai (Alibaba Group); Feifei Li (Alibaba Group); Gang Chen (Zhejiang University) -
Sibyl: Forecasting Time-Evolving Query Workloads
Hanxian Huang (UC San Diego)*; Tarique Siddiqui (Microsoft Research); Rana Alotaibi (Microsoft Gray Systems Lab); Carlo Curino (Microsoft -- GSL); Jyoti Leeka (Microsoft); Alekh Jindal (SmartApps); Jishen Zhao (UCSD); Jesús Camacho-Rodríguez (Microsoft); Yuanyuan Tian (Microsoft Gray Systems Lab) -
LST-Meter: Benchmarking Log-Structured Tables in the Cloud
Jesús Camacho-Rodríguez (Microsoft)*; Ashvin Agrawal (Microsoft - GSL); Anja Gruenheid (Microsoft); Ashit Gosalia (Microsoft); Cristian Petculescu (Microsoft); Josep Aguilar Saborit (Microsoft); Avrilia Floratou (Microsoft); Carlo Curino (Microsoft -- GSL); Raghu Ramakrishnan (Microsoft) -
System-X: Towards Fast Dependency Graph Generation for Database Replay
Wonseok Lee (POSTECH); Jaehyun Ha (POSTECH); Wook-Shin Han (POSTECH)*; Changgyoo Park (SAP Labs Korea); Myunggon Park (SAP Labs Korea); Juhyeng Han (SAP); Juchang Lee (SAP) -
Revisiting B-tree Compression: An Experimental Study
Chuqing Gao (Purdue University); Shreya Ballijepalli (Purdue University); Jianguo Wang (Purdue University)* -
PLATON: Top-down R-tree Packing with Learned Partition Policy
Jingyi Yang (Nanyang Technological University)*; Gao Cong (Nanyang Technological Univesity)
Session 13:Data Systems for AI and ML (2)
Wednesday June 12 1:00 pm – 2:30 pm
Location: Tupungato
Session Chair: Avigdor Gal
-
BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach
Zhen Zheng (Alibaba Group)*; Zaifeng Pan (Renmin University of China); Dalin Wang (Renmin University of China); Kai Zhu (Alibaba Inc.); Wenyi Zhao (Alibaba Group.); tianyou guo (Alibaba Inc); Xiafei Qiu (Alibaba Group); minmin sun (Alibaba); Junjie Bai (Alibaba Group); Feng Zhang (Renmin University of China); Xiaoyong Du (Renmin University of China); Jidong Zhai (Tsinghua University); Wei Lin (Alibaba Group) -
Nexus: Correlation Discovery over Collections of Spatio-Temporal Tabular Data
Yue Gong (The University of Chicago)*; Sainyam Galhotra (Cornell University); Raul Castro Fernandez (The University of Chicago) -
DGC: Training Dynamic Graphs with Spatio-Temporal Non-Uniformity using Graph Partitioning by Chunks
Fahao Chen (The University of Aizu); Peng Li (the University of Aizu)*; Celimuge Wu (The University of Electro-Communications) -
HongTu: Scalable Full-Graph GNN Training on Multiple GPUs
Qiange Wang (National University of Singapore)*; Yao Chen (National University of Singapore); Weng-Fai Wong (National University of Singapore); Bingsheng He (National University of Singapore) -
Efficient Algorithm for K-Multiple-Means
Yasuhiro Fujiwara (NTT Communication Science Laboratories)*; Atsutoshi Kumagai (NTT Computer and Data Science Laboratories); Yasutoshi Ida (NTT); Masahiro Nakano (NTT communication science laboratories); Makoto Nakatsuji (NTT); Akisato Kimura (NTT Corporation) -
FeatureLTE: Learning to Estimate Feature Importance
Tianping Zhang (Tsinghua University); Zhaoyang Wang (Ant Group); Chen Qian (Ant Group); Jian Li (" Tsinghua University, China"); Yin Lou (Ant Group)*
Session 14: Storage (2)
Wednesday June 12 1:00 pm – 2:30 pm
Location: Llaima
Session Chair: Umar Farooq Minhas
-
Wii: Dynamic Budget Reallocation In Index Tuning
Xiaoying Wang (Simon Fraser University); Wentao Wu (Microsoft Research)*; Chi Wang (Microsoft Research); Vivek Narasayya (Microsoft); Surajit Chaudhuri (Microsoft) -
MirrorKV: An Efficient Key-Value Store on Hybrid Cloud Storage with Balanced Performance of Compaction and Querying
Zhiqi Wang (The Chinese University of Hong Kong)*; Zili Shao (The Chinese University of Hong Kong) -
Limousine: Blending Learned and Classical Indexes to Self-Design Larger-than-Memory Cloud Storage Engines
Subarna Chatterjee (Harvard University, USA)*; Mark F Pekala (Harvard); Lev Kruglyak (Harvard University); Stratos Idreos (Harvard) -
PreVision: An Out-of-Core Matrix Computation System with Optimal Buffer Replacement
Kyoseung Koo (Seoul National University); Sohyun Kim (Seoul National University); Wonhyeon Kim (Seoul National University); Yoojin Choi (Seoul National University); Juhee Han (Seoul National Unviersity); Bogyeong Kim (Seoul National University); Bongki Moon (Seoul National University)* -
Hyper: A High-Performance and Memory-Efficient Learned Index via Hybrid Construction
Shunkang Zhang (HKUST); Ji Qi (Institute of Software, Chinese Academy of Sciences); Xin YAO (Huawei Theory Lab); André Brinkmann (Johannes Gutenberg University Mainz)* -
Wred: Workload Reduction for Scalable Index Tuning
Matteo Brucato (Microsoft Research)*; Tarique Siddiqui (Microsoft Research); Wentao Wu (Microsoft Research); Vivek Narasayya (Microsoft); Surajit Chaudhuri (Microsoft)
Session 15: Data Security (2)
Wednesday June 12 1:00 pm – 2:30 pm
Location: Puyehue/Calbuco
Session Chair: Li Xiong
-
DProvDB: Differentially Private Query Processing with Multi-Analyst Provenance
Shufan Zhang (University of Waterloo)*; Xi He (University of Waterloo) -
DP-starJ: A Differential Private Scheme towards Analytical Star-Join Queries
Congcong Fu (Xidian University); Hui Li (Xidian University)*; Jian Lou (Zhejiang University); Huizhen Li (Xidian University); Jiangtao Cui (Xidian University) -
Anchor: A Library for Building Secure Persistent Memory Systems
Dimitrios Stavrakakis (TU Munich & University of Edinburgh)*; Dimitra Giantsidi (University of Edinburgh); Maurice Bailleu (The University of Edinburgh); Philip Sändig (Technical University of Munich); Shady Issa (TUM); Pramod Bhatotia (TU Munich) -
VEIL: A Storage and Communication Efficient Volume-Hiding Algorithm
Shanshan Han (UCI)*; Vishal Chakraborty (University of California, Irvine); Michael T Goodrich (Univ. of California, Irvine); Sharad Mehrotra (U.C. Irvine); Shantanu Sharma (New Jersey Institute of Technology) -
An LDP Compatible Sketch for Securely Approximating Set Intersection Cardinalities
Pinghui Wang (Xi'an Jiaotong University)*; Yitong Liu (Xi’an Jiaotong University); Zhicheng Li (Xi'an Jiaotong University); Rundong Li (Xi'an Jiaotong University) -
Local Differentially Private Heavy Hitter Detection in Data Streams with Bounded Memory
Xiaochen Li (Zhejiang university)*; Weiran Liu (Alibaba Group); Jian Lou (Zhejiang University); Yuan Hong (University of Connecticut); Lei Zhang (Alibaba Group); Zhan Qin (Zhejiang University); Kui Ren (Zhejiang University)
Session 16: Semistructured and Uncertain Data
Wednesday June 12 1:00 pm – 2:30 pm
Location: Aconcagua
Session Chair: Arijit Khan
-
ChainedFilter: Combining Membership Filters by Chain Rule
Haoyu Li (The University of Texas at Austin)*; Liuhui Wang (Peking University); Qizhi Chen (Peking University); Jianan Ji (Peking University); Yuhan Wu (Peking University); Yikai Zhao (Peking University); Tong Yang (Peking University); Aditya Akella (UT Austin) -
Reservoir Sampling over Joins
Binyang DAI (Hong Kong University of Science and Technology)*; Xiao Hu (University of Waterloo); Ke Yi (Hong Kong Univ. of Science and Technology) -
StarfishDB: a query execution engine for relational probabilistic programming
Ouael Ben Amara (University of Michigan - Dearborn); sami hadouaj (university of michigan dearborn); Niccolo Meneghetti (University of Michigan - Dearborn)* -
High-Ratio Compression for Machine-Generated Data
Jiujing Zhang (Guangzhou University); Zhitao Shen (Ant Group); Shiyu Yang (Guangzhou University)*; Lingkai Meng (Shanghai Jiao Tong University); Chuan Xiao (Osaka University, Nagoya University); wei jia (antgroup); Yue Li (Ant Group); Qinhui Sun (Ant Group); Wenjie Zhang (University of New South Wales); Xuemin Lin (Shanghai Jiaotong University) -
AS-Parser: Log Parsing Based on Adaptive Segmentation
Chen XiaoLei (Fudan University); Peng Wang (" Fudan University, China")*; Jia Chen (Fudan University); Wei Wang (" Fudan University, China") -
RITA: Group Attention is All You Need for Timeseries Analytics
Jiaming Liang (University of Pennslyvania); Lei Cao (University of Arizona/MIT)*; Samuel Madden (MIT); Zack Ives (University of Pennsylvania); Guoliang Li (Tsinghua University)
Session 17: Graphs (3)
Wednesday June 12 3:00 pm – 4:30 pm
Location: Parinacota
Session Chair: Senjuti Basu Roy
-
MCR-Tree: An Efficient Index for Multi-dimensional Core Search
Chengyang Luo (Zhejiang University); Yifan Zhu (Zhejiang University); Qing Liu (Zhejiang University); Yunjun Gao (Zhejiang University)*; Lu Chen (Zhejiang University); Jianliang Xu (Hong Kong Baptist University) -
Efficient and Provable Effective Resistance Computation on Large Graphs: an Index-based Approach
Meihao Liao (Beijing Institute of Technology); Zhou Junjie (Beijing Institude of technology); Ronghua Li (Beijing Institute of Technology)*; Qiangqiang Dai (Beijing Institute of Technology); Hongyang Chen (Zhejiang Lab); Guoren Wang (Beijing Institute of Technology) -
Graph Summarization: Compactness Meets Efficiency
Deming Chu (University of New South Wales); Fan Zhang (Guangzhou University)*; Wenjie Zhang (University of New South Wales); Ying Zhang (University of Technology Sydney); Xuemin Lin (University of New South Wales) -
A Counting-based Approach for Efficient 𝑘-Clique Densest Subgraph Discovery
Yingli Zhou (The Chinese University of Hong Kong, Shenzhen)*; Qingshuo Guo (The Chinese University of Hong Kong, Shenzhen); Yixiang Fang (The Chinese University of Hong Kong, Shenzhen); Chenhao Ma (The Chinese University of Hong Kong, Shenzhen) -
Implementation Strategies for Views over Property Graphs
Soonbo Han (University of Pennsylvania)*; Zack Ives (University of Pennsylvania) -
CAVE: Concurrency-Aware Graph Processing on SSDs
Tarikul Islam Papon (Boston University)*; Taishan Chen (Boston University); Shuo Zhang (Columbia University); Manos Athanassoulis (Boston University)
Session 18: Query Processing (3)
Wednesday June 12 3:00 pm – 4:30 pm
Location: Tupungato
Session Chair: Dan Suciu
-
Query Compilation Without Regrets
Philipp M Grulich (Technische Universität Berlin)*; Aljoscha P Lepping (TU Berlin); Dwi P. A. Nugroho (Technische Universität Berlin); Varun Pandey (TU Berlin); Bonaventura Del Monte (Observe Inc.); Steffen Zeuch (TU Berlin); Volker Markl (Technische Universität Berlin) -
Cabin: a Compressed Adaptive Binned Scan Index
Yiyuan Chen (University of Chinese Academy of Sciences); Shimin Chen (Chinese Academy of Sciences)* -
Efficient Approximation of Kemeny's Constant for Large Graphs
Haisong Xia (Fudan University)*; Zhongzhi Zhang (Fudan University) -
Worst-Case-Optimal Similarity Joins on Graph Databases
Diego Arroyuelo (UTFSM, Chile); Benjamin Bustos (Department of Computer Science, University of Chile); Adrián Gómez-Brandón (Universidade da Coruña, Spain); Aidan Hogan (Universidad de Chile, Chile); Gonzalo Navarro (University of Chile); Juan Reutter (PUC)* -
Hierarchical Cut Labelling – Scaling Up Distance Queries on Road Networks
Muhammad Farhan (Australian National University)*; Henning Koehler (Massey University); Robert Ohms (Australian National University); Qing Wang (ANU) -
MWP: Multi-Window Parallel Evaluation of Regular Path Queries on Streaming Graphs
Siyuan Zhang (Fudan University)*; Zhenying He (Fudan University); Yinan Jing (Fudan University); Kai Zhang (Fudan University); X. Sean Wang (Fudan University)
Session 19: AI and ML in Databases (2)
Wednesday June 12 3:00 pm – 4:30 pm
Location: Llaima
Session Chair: Ibrahim Sabek
-
Lorentz: Learned SKU Recommendation Using Profile Data
Nicholas K Glaze (Microsoft); Irwin H McNeely (Microsoft); Yiwen Zhu (Microsoft)*; Matthew Gleeson (Microsoft); Helen Serr (Microsoft ); Rajeev S Bhopi (MICROSOFT ); Subru Krishnan (Microsoft) -
SchemaPile: A Large Collection of Relational Database Schemas
Till Döhmen (University of Amsterdam)*; Radu Geacu (University of Amsterdam); Madelon Hulsebos (UC Berkeley); Sebastian Schelter (University of Amsterdam) -
Solo: Data Discovery using Natural Language Questions via a Self-Supervised Approach
Qiming Wang (The University of Chicago)*; Raul Castro Fernandez (The University of Chicago) -
Controllable Tabular Data Synthesis Using Diffusion Models
Tongyu Liu (Renmin University of China); Ju Fan (Renmin University of China)*; Nan Tang (HKUST (GZ)); Guoliang Li (Tsinghua University); Xiaoyong Du (Renmin University of China) -
One seed, two birds: A unified learned structure for exact and approximate counting.
Yingze Li (Harbin Institude of Technology); Hongzhi Wang (Harbin Institute of Technology)*; Xianglong Liu (Harbin Institute of Technology) -
Making In-Memory Learned Indexes Efficient on Disk
Jiaoyi Zhang (Tsinghua University)*; Kai Su (Tsinghua University); Huanchen Zhang (Tsinghua University)
Session 20: Spatiotemporal data
Wednesday June 12 3:00 pm – 4:30 pm
Location: Aconcagua
Session Chair: Jan Van den Bussche
-
Origin-Destination Travel Time Oracle for Map-based Services
Yan Lin (Beijing Jiaotong University); Huaiyu Wan (Beijing Jiaotong University); Jilin Hu (Aalborg University)*; Shengnan Guo (Beijing Jiaotong University); Bin Yang (Aalborg University); Youfang Lin (Beijing Jiaotong University); Christian S. Jensen (Aalborg University) -
Demystifying the QoS and QoE of Edge-hosted Video Streaming Applications in the Wild with SNESet
Yanan Li (State Key Laboratory of Networking and Switching Technology)*; Guangqing Deng (Alibaba Cloud); Changming Bai (Alibaba Cloud); Jingyu Yang (Alibaba Cloud); Gang Wang (Alibaba Cloud); Hao Zhang (Alibaba Cloud); Jin Bai (Alibaba Cloud); Haitao Yuan (Nanyang Technological University); Mengwei Xu (State Key Laboratory of Networking and Switching Technology); Shangguang Wang (State Key Laboratory of Networking and Switching Technology) -
Temporal JSON Keyword Search
Curtis Dyreson (Utah State University)*; Amani Shatnawi (Yarmouk University ); Sourav S Bhowmick (Nanyang Technological University); Vishal Sharma (University of Nevada, Las Vegas) -
Proximity Queries on Point Clouds using Rapid Construction Path Oracle
Yinzhao YAN (Hong Kong University of Science and Technology)*; Raymond Chi-Wing Wong (Hong Kong University of Science and Technology) -
FineMon: An Innovative Adaptive Network Telemetry Scheme for Fine-Grained, Multi-Metric Data Monitoring with Dynamic Frequency Adjustment and Enhanced Data Recovery
Haojie Ji (Hunan university)*; Kun Xie (hunan university); Jigang Wen (Hunan University of Science and Technology); Qingyi Zhang (Huawei Technologies); Gaogang Xie (Institute of Computing Technology, Chinese Academy of Sciences); Wei Liang (Hunan University of science and technology) -
Optimizing Time Series Queries with Versions
Rui Kang (Tsinghua University); Shaoxu Song (Tsinghua University)*
Session 21: Graphs (4)
Wednesday June 12 5:00 pm – 6:30 pm
Location: Parinacota
Session Chair: Angela Bonifati
-
Parallel Algorithms for Hierarchical Nucleus Decomposition
Jessica Shi (MIT); Laxman Dhulipala (University of Maryland, College Park); Julian Shun (MIT)* -
View-based Explanations for Graph Neural Networks
Tingyang Chen (Zhejiang University); Dazhuo Qiu (Aalborg University); Yinghui Wu (Case Western Reserve University); Arijit Khan (Aalborg University); Xiangyu Ke (Zhejiang University, China)*; Yunjun Gao (Zhejiang University) -
GE^2: A General and Efficient Graph Embedding Learning System
Chenguang Zheng (CUHK)*; Guanxian Jiang (CUHK); Xiao Yan (Centre for Perceptual and Interactive Intelligence (CPII) ); Peiqi Yin (The Chinese University of Hong Kong); qihui zhou (CUHK); James Cheng (CUHK) -
TeraHAC: Hierarchical Agglomerative Clustering of Trillion-Edge Graphs
Laxman Dhulipala (UMD and Google Research)*; Jason Lee (Google); Jakub Łącki (Google); Vahab Mirrokni (Google) -
Neural Attributed Community Search at Billion Scale
Jianwei Wang (University of New South Wales)*; Kai Wang (Shanghai Jiao Tong University); Xuemin Lin (University of New South Wales); Wenjie Zhang (University of New South Wales); Ying Zhang (University of Technology Sydney) -
Enriching Recommendation Models with Logic Conditions
Lihang Fan (Beihang University); Wenfei Fan (Univ. of Edinburgh ); Ping Lu (Beihang Univ.); Chao Tian (Beihang University)*; Qiang Yin (Shanghai Jiao Tong University)
Session 22: Query Processing (4)
Wednesday June 12 5:00 pm – 6:30 pm
Location: Tupungato
Session Chair: Kyuseok Shim
-
GEqO: ML-Accelerated Semantic Equivalence Detection
Rana Alotaibi (Microsoft Gray Systems Lab); Brandon Haynes (Microsoft Gray Systems Lab)*; Yuanyuan Tian (Microsoft Gray Systems Lab); Anna Pavlenko (Microsoft Gray Systems Lab); Jyoti Leeka (Microsoft); Alekh Jindal (SmartApps) -
Lemo: A Cache-Enhanced Learned Optimizer for Concurrent Queries
Songsong Mo (Nanyang Technological University)*; Yile Chen (Nanyang Technological University); Hao Wang (Nanyang Technological University); Gao Cong (Nanyang Technological Univesity); Zhifeng Bao (RMIT University) -
ASM: Harmonizing Autoregressive model, Sampling, and Multi-dimensional Statistics Merging for Cardinality Estimation
Kyoungmin Kim (POSTECH); Sangoh Lee (POSTECH); Injung Kim (Handong Global University); Wook-Shin Han (POSTECH)* -
LPLM: A Neural Language Model for Cardinality Estimation of LIKE-Queries
Mehmet Aytimur (University of Konstanz)*; Silvan Reiner (University of Konstanz); Leonard Wörteler (Universität Konstanz); Theodoros Chondrogiannis (University of Konstanz); Michael Grossniklaus (University of Konstanz) -
A Learned Cuckoo Filter for Approximate Membership Queries over Variable-sized Sliding Windows on Data Streams
Yao Tian (The Hong Kong University of Science and Technology)*; yan tingyun (GuangZhou University); Ruiyuan Zhang (The Hong Kong university of Science and Technology); Kai Huang ( Macau University of Science and Technology); Bolong Zheng (Huazhong University of Science and Technology); Xiaofang Zhou (The Hong Kong University of Science and Technology) -
NOCAP: Near-Optimal Correlation-Aware Partitioning Joins
Zichen Zhu (Boston University)*; Xiao Hu (University of Waterloo); Manos Athanassoulis (Boston University)
Session 23: Storage (3)
Wednesday June 12 5:00 pm – 6:30 pm
Location: Llaima
Session Chair: Prashant Pandey
-
MOST: Model-Based Compression with Outlier Storage for Time Series Data
Zehai Yang (Institute of Computing Technology, CAS & University of Chinese Academy of Sciences); Shimin Chen (Chinese Academy of Sciences)* -
Spruce: a Fast yet Space-saving Structure for Dynamic Graph Storage
Jifan Shi (University of Science and Technology of China)*; Biao Wang (University of Science and Technology of China); Yun Xu (University of Science and Technology of China) -
Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads
Dingheng Mo (Nanyang Technological University); Fanchao Chen (Fudan University); Siqiang Luo (Nanyang Technological University)*; Caihua Shan (microsoft) -
SALI: A Scalable Adaptive Learned Index Framework based on Probability Models
Jiake Ge (Renmin University of China)*; Huanchen Zhang (Tsinghua University); Boyu Shi (RENMIN UNIVERSITY of CHINA); Yuanhui Luo (Renmin University of China); Yunda Guo (Renmin University of China); yunpeng chai (renmin university of china); Yuxing Chen (Tencent); Anqun Pan (Tencent Inc., China) -
SWIX: A Memory-efficient Sliding Window Learned Index
Liang Liang (Imperial College London)*; Guang Yang (Imperial College London); Ali Hadian (Imperial College London); Luis Alberto Croquevielle (Imperial College London); Thomas Heinis (Imperial College) -
GRF: A Global Range Filter for LSM-Trees with Shape Encoding
Hengrui Wang (Tsinghua University)*; Te Guo (Purdue University); Junzhao Yang (Tsinghua University); Huanchen Zhang (Tsinghua University)
Session 24: Data Systems for AI and ML (3)
Wednesday June 12 5:00 pm – 6:30 pm
Location: Puyehue/Calbuco
Session Chair: Kexin Rong
-
Data Acquisition for Improving Model Confidence
Yifan Li (York University)*; Xiaohui Yu (York University); Nick Koudas (University of Toronto) -
Range-Filtering Approximate Nearest Neighbor Search
Chaoji Zuo (Rutgers University - New Brunswick); Miao Qiao (The University of Auckland); Wenchao Zhou (Alibaba Group); Feifei Li (Alibaba Group); Dong Deng (Rutgers University - New Brunswick)* -
ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data
Liana Patel (Stanford University)*; Matei Zaharia (Berkeley and Databricks); Peter Kraft (DBOS, Inc.); Carlos Guestrin (Stanford University) -
Generation of Training Examples for Tabular Natural Language Inference
Jean-Flavien Bussotti (Eurecom); Enzo Veltri (Università della Basilicata); Donatello Santoro (Università della Basilicata); Paolo Papotti (EURECOM)*
Session 25: Graphs (5) + Data Exploration (2)
Thursday June 13 1:00 pm – 2:30 pm
Location: Tupungato
Session Chair: Ioana Manolescu
-
On Querying Historical Connectivity in Temporal Graphs
Jingyi Song (University of New South Wales); Dong Wen (University of New South Wales)*; Lantian Xu (University of Technology Sydney); Lu Qin (UTS); Wenjie Zhang (University of New South Wales); Xuemin Lin (Shanghai Jiaotong University) -
uBlade: Efficient Batch Processing for Uncertainty Graph Queries
Siyuan Yao (SMU); Yuchen Li (Singapore Management University)*; Shixuan Sun (Shanghai Jiao Tong University); Jiaxin Jiang (National University of Singapore); Bingsheng He (National University of Singapore) -
Materialized View Selection & View-Based Query Planning for Regular Path Queries
Yue Pang (Peking University)*; Lei Zou (Peking University); Jeffrey Xu Yu (Chinese University of Hong Kong); Linglin Yang (Peking University) -
TabEE: Tabular Embeddings Explanations
Roni Copul (Tel Aviv University); Nave Frost (eBay); Tova Milo (Tel Aviv University); Kathy Razmadze (Tel Aviv University)* -
Auto-Formula: Recommend Formulas in Spreadsheets using Learned Table Representations
Sibei Chen (Renmin University of China); Yeye He (Microsoft Research)*; Weiwei Cui (Microsoft Research Asia); Ju Fan (Renmin University of China); Song Ge (Microsoft Reseach Asia); Haidong Zhang (Microsoft Research Asia); Dongmei Zhang (Microsoft Research Asia); Surajit Chaudhuri (Microsoft) -
Qr-Hint: Actionable Hints Towards Correcting Wrong SQL Queries
Yihao Hu (Duke University)*; Amir Gilad (The Hebrew University); Kristin Stephens-Martinez (Duke University); Sudeepa Roy (Duke University, USA); Jun Yang (Duke University)
Session 26: Data Warehousing + Distributed Databases (1)
Thursday June 13 1:00 pm – 2:30 pm
Location: Puyehue/Calbuco
Session Chair: Aparna Varde
-
SH2O: Efficient Data Access for Work-sharing Databases
Panagiotis Sioulas (EPFL)*; Ioannis Mytilinis (EPFL); Anastasia Ailamaki (EPFL) -
Lightweight Materialization for Fast Dashboards Over Joins
Zezhou Huang (Columbia University)*; Eugene Wu (Columbia University) -
Rethinking the Encoding of Integers for Scans on Skewed Data
Martin Prammer (University of Wisconsin - Madison)*; Jignesh Patel (Carnegie Mellon University) -
Determining Exact Quantiles with Randomized Summaries
Ziling Chen (Tsinghua University); Haoquan Guan (Tsinghua University); Shaoxu Song (Tsinghua University)*; Xiangdong Huang (Tsinghua University); Chen Wang (Timecho Limited); Jianmin Wang ("Tsinghua University, China") -
Scalable Distributed Inverted List Indexes in Disaggregated Memory
Manuel Widmoser (University of Salzburg)*; Daniel Kocher (University of Salzburg); Nikolaus Augsten (University of Salzburg) -
Fault Tolerance Placement for the Internet of Things
Anastasiia Kozar (TU Berlin)*; Bonaventura Del Monte (Observe Inc.); Steffen Zeuch (TU Berlin); Volker Markl (Technische Universität Berlin)
Session 27: Storage (4)
Thursday June 13 1:00 pm – 2:30 pm
Location: Europa
Session Chair: Zsolt István
-
LeCo: Lightweight Compression via Learning Serial Correlations
Yihao Liu (Tsinghua University)*; Xinyu Zeng (Tsinghua University); Huanchen Zhang (Tsinghua University) -
Grafite: Taming Adversarial Queries with Optimal Range Filters
Marco Costa (University of Pisa); Paolo Ferragina (Università di Pisa); Giorgio Vinciguerra (Università di Pisa)* -
ChainKV: A Semantics-Aware Key-Value Store for Ethereum System
Zehao Chen (Shandong University)*; Bingzhe Li (University of Texas at Dallas); Xiaojun Cai (Shandong University); Zhiping Jia (Shandong University); Lei Ju (Shandong University); Zili Shao (The Chinese University of Hong Kong); Zhaoyan Shen (Shandong University) -
LIT: Lightning-fast In-memory Temporal Indexing
George Christodoulou (Delft University of Technology); Panagiotis Bouros (Johannes Gutenberg University Mainz)*; Nikos Mamoulis (University of Ioannina) -
Practical Dynamic Extension for Sampling Indexes
Douglas B Rumbaugh (Penn State University)*; Dong Xie (Penn State University)
Session 28: Cloud Management
Thursday June 13 1:00 pm – 2:30 pm
Location: Antartica
Session Chair: Justin Levandoski
-
VeriTxn: Verifiable Transactions for Cloud-Native Databases with Storage Disaggregation
Zhanhao Zhao (Renmin University of China); Hexiang Pan (National University of Singapore); Gang Chen (Zhejiang University); Xiaoyong Du (Renmin University of China); WEI LU (Renmin University of China); Beng Chin Ooi (NUS)* -
Cackle: Analytical Workload Cost and Performance Stability With Elastic Pools
Matthew J Perron (MIT CSAIL)*; Michael Cafarella (MIT CSAIL); Raul Castro Fernandez (The University of Chicago); David DeWitt (MIT); Samuel Madden (MIT) -
High-performance Effective Scientific Error-bounded Lossy Compression with Auto-tuned Multi-component Interpolation
Jinyang Liu (University of California, Riverside); Sheng Di (Argonne National Laboratory, Lemont, IL)*; Kai Zhao (Florida State University); Xin Liang (University of Kentucky); Sian Jin (Indiana University); Zizhe Jian (University of California Riverside); Jiajun Huang (UCR); Shixun Wu (University of California Riverside); zizhong chen (UC Riverside); Franck Cappello (Argonne National Laboratory, Lemont, IL) -
SkyPIE: A Fast & Accurate Oracle for Object Placement
Tiemo Bang (UC Berkeley)*; Chris Douglas (UC Berkeley ); Natacha Crooks (UC Berkeley); Joseph M Hellerstein (UC Berkeley) -
Vexless: A Serverless Vector Data Management System Using Cloud Functions
Yongye Su (Purdue University); Yinqi Sun (Purdue University); Minjia Zhang (Microsoft AI and Research); Jianguo Wang (Purdue University)* -
Understanding the Performance Implications of the Design Principles in Storage-Disaggregated Databases
Xi Pang (Purdue University); Jianguo Wang (Purdue University)*
Session 29: Query Processing (5) + Emerging/Embedded
Thursday June 13 3:00 pm – 4:30 pm
Location: Parinacota
Session Chair: Florin Rusu
-
Rethink Query Optimization in HTAP Databases
Haoze Song (The University of Hong Kong)*; Wenchao Zhou (Alibaba Group); Feifei Li (Alibaba Group); Xiang Peng (Alibaba); Heming Cui (The University of Hong Kong) -
Correlation Joins over Time Series Data Streams Utilizing Complementary Dimension Reduction and Transformation
AmirReza Alizade Nikoo (University of Zürich)*; Sven Helmer (University of Zurich); Michael H Böhlen (University of Zurich) -
In-depth Analysis of Continuous Subgraph Matching in a Common Delta Query Compilation Framework
Yukyoung Lee (POSTECH); Kyoungmin Kim (POSTECH); Wonseok Lee (POSTECH); Wook-Shin Han (POSTECH)* -
gSWORD: GPU-accelerated Sampling for Subgraph Counting
chang ye (SMU)*; Yuchen Li (Singapore Management University); Shixuan Sun (Shanghai Jiao Tong University); Wentian Guo (Meta Inc.) -
Zero-sided RDMA: Network-driven Data Shuffling for Disaggregated Heterogeneous Cloud DBMSs
Matthias Jasny (TU Darmstadt)*; Lasse Thostrup (TU Darmstadt); Sajjad Tamimi (TU Darmstadt); Andreas Koch (TU Darmstadt); Zsolt István (TU Darmstadt); Carsten Binnig (TU Darmstadt) -
PimPam: Efficient Graph Pattern Matching on Real Processing-in-Memory Hardware
Shuangyu Cai (Tsinghua University); Boyu Tian (Tsinghua University); Huanchen Zhang (Tsinghua University); Mingyu Gao (Tsinghua University)*
Session 30: Responsible Data Management (1) + Multimedia + NLP
Thursday June 13 3:00 pm – 4:30 pm
Location: Tupungato
Session Chair: Brit Youngmann
-
F3KM: Federated, Fair, and Fast k-means
Shengkun Zhu (Wuhan University)*; Quanqing Xu (OceanBase, Ant Group ); Jinshan ZENG (Jiangxi Normal University); Sheng Wang (Wuhan University); Yuan Sun (La Trobe University); Zhifeng Yang (OceanBase); Chuanhui Yang (OceanBase); Zhiyong Peng (" Wuhan University, China") -
Faster Algorithms for Fair Max-Min Diversification in R^d
Yash Kurkure (University of Illinois Chicago); Miles Shamo (University of Illinois at Chicago); Joseph Wiseman (UIC); Sainyam Galhotra (Cornell University); Stavros Sintos (University of Illinois Chicago)* -
SeeSaw: Interactive Ad-hoc Search Over Image Databases
Oscar Moll (MIT CSAIL)*; Manuel A Favela (Massachusetts Institute of Technology); Vijay Gadepally (MIT Lincoln Laboratory); Michael Cafarella (MIT CSAIL); Samuel Madden (MIT) -
Predictive and Near-Optimal Sampling for View Materialization in Video Databases
Yanchao Xu (Zhejiang University); Dongxiang Zhang (Zhejiang University)*; Shuhao Zhang (Nanyang Technological University); Sai Wu (Zhejiang University); Zexu Feng (Zhejiang University); Gang Chen (Zhejiang University) -
RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search
Jianyang Gao (Nanyang Technological University); Cheng Long (Nanyang Technological University)* -
CodeS: Towards Building Open-source Language Models for Text-to-SQL
Haoyang Li (Renmin University of China)*; Jing Zhang (Renmin University of China); Hanbing Liu (Renmin University of China); Ju Fan (Renmin University of China); Xiaokang Zhang (Renmin University of China); jun zhu (BEIJING AI-FINANCE TECHNOLOGIES CO. LTD); Renjie Wei (BEIJING AI-FINANCE TECHNOLOGIES CO. LTD); Hongyan Pan (BEIJING AI-FINANCE TECHNOLOGIES CO. LTD); Cuiping Li (Renmin University of China); Hong Chen (" Renmin University, China")
Session 31: Distributed Databases (2) + Transaction Processing
Thursday June 13 3:00 pm – 4:30 pm
Location: Puyehue/Calbuco
Session Chair: Vasiliki Kalavri
-
NOC-NOC: Towards Performance-optimal Distributed Transactions
Si Liu (ETH Zurich)*; Luca Multazzu (ETH Zurich); Hengfeng Wei (Nanjing University); David A Basin (ETH Zurich) -
Efficient Distributed Hop-Constrained Path Enumeration on Large-Scale Graphs
yuanyuan zeng (Chinese University of Hong Kong, Shenzhen)*; Yixiang Fang (The Chinese University of Hong Kong, Shenzhen); Chenhao Ma (The Chinese University of Hong Kong, Shenzhen); Xu Zhou (Hunan university); Kenli Li (Hunan University) -
Historical Embedding-Guided Efficient Large-Scale Federated Graph Learning
Anran Li (Nanyang Technological University)*; Yuanyuan Chen (Nanyang Technological University); Mingfei Cheng (Singapore Management University); Yihao Huang (Nanyang Technological University); Jian Zhang (Nanyang Technological University); Yueming Wu (Nanyang Technological University); Anh Tuan Luu (Nanyang Technological University); Han Yu (Nanyang Technological University (NTU)) -
Play like a Vertex: A Stackelberg Game Approach for Streaming Graph Partitioning
Zezhong Ding (University of Science and Technology of China)*; Yongan Xiang (University of Science and Technology of China ); Shangyou Wang (University of Science and Technology of China); Xike Xie (University of Science and Technology of China); S. Kevin Zhou (USTC) -
Optimizing Distributed Protocols with Query Rewrites
David CY Chu (UC Berkeley)*; Rithvik Panchapakesan (UC Berkeley); Shadaj Laddad (UC Berkeley); Lucky E Katahanas (Sutter Hill Ventures); Chris Liu (University of California, Berkeley); Kaushik Shivakumar (University of California); Natacha Crooks (UC Berkeley); Joseph M Hellerstein (UC Berkeley); Heidi Howard (Microsoft) -
ADGNN: Towards Scalable GNN Training with Aggregation-Difference Aware Sampling
Zhen Song (Northeastern University)*; Yu Gu (Northeastern University); Tianyi Li (Aalborg University); Qing Sun (Northeastern University); Yanfeng Zhang (Northeastern University); Christian S. Jensen (Aalborg University); Ge Yu (Northeastern University)
Session 32: Data Integration + Provenance (1)
Thursday June 13 3:00 pm – 4:30 pm
Location: Antartica
Session Chair: Wolfgang Gatterbauer
-
Determining the Largest Overlap between Tables
Luca Zecchini (Università degli Studi di Modena e Reggio Emilia)*; Tobias Bleifuß (Hasso Plattner Institute); Giovanni Simonini (University of Modena and Reggio Emilia); Sonia Bergamaschi (Università di Modena e Reggio Emilia); Felix Naumann (Hasso Plattner Institute, University of Potsdam) -
High Precision ≠ High Cost: Temporal Data Fusion for Multiple Low-Precision Sensors
jingyu zhu (Nankai university); Yu Sun (Nankai University)*; Shaoxu Song (Tsinghua University); Xiaojie Yuan (Nankai Univeristy) -
Homomorphic Compression: Making Text Processing on Compression Unlimited
JiaWei Guan (Renmin University of China)*; Feng Zhang (Renmin University of China); Siqi Ma (University of New South Wales); kuangyu chen (renmin univers.); Yihua Hu (Renmin University of China); Yuxing Chen (Tencent); Anqun Pan (Tencent Inc., China); Xiaoyong Du (Renmin University of China) -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Peng Li (Georgia Institute of Technology); Yeye He (Microsoft Research)*; Dror Y (Microsoft); Weiwei Cui (Microsoft Research Asia); Song Ge (Microsoft Reseach Asia); Haidong Zhang (Microsoft Research Asia); Danielle Rifinski Fainman (Microsoft); Dongmei Zhang (Microsoft Research Asia); Surajit Chaudhuri (Microsoft) -
Udon: Efficient Debugging of User-Defined Functions in Big Data Systems with Line-by-Line Control
Yicong Huang (UC Irvine)*; Zuozhi Wang (U C IRVINE); Chen Li (UC Irvine) -
Banzhaf Values for Facts in Query Answering
Omer Abramovich (Tel Aviv University); Daniel Deutch (Tel Aviv University)*; Nave Frost (eBay); Ahmet Kara (University of Zurich); Dan Olteanu (University of Zurich)
Session 33: AI and ML in DB (3)
Thursday June 13 5:00 pm – 6:30 pm
Location: Parinacota
Session Chair: Nesime Tatbul
-
Modeling Shifting Workloads for Learned Database Systems
Peizhi Wu (University of Pennsylvania)*; Zack Ives (University of Pennsylvania) -
Cardinality Estimation over Knowledge Graphs with Embeddings and Graph Neural Networks
Tim Schwabe (Ruhr University Bochum)*; Maribel Acosta (Technische Universität München) -
Approximate Sketches
Brian Tsan (UC Merced)*; Asoke Datta (University of California, Merced); Yesdaulet Izenov (University of California, Merced); Florin Rusu (UC Merced) -
PreLog: A Pre-trained Model for Log Analytics
Van-Hoang Le (The University of Newcastle)*; Hongyu Zhang (University of Newcastle) -
Can Learned Indexes be Build Efficient? A Deep Dive into Sampling Trade-Offs
Minguk Choi (Dankook University); Seehwan Yoo (Dankook University); Jongmoo Choi (Dankook University)* -
ThalamusDB: Approximate Query Processing on Multi-Modal Data
Saehan Jo (Cornell University)*; Immanuel Trummer (Cornell University)
Session 34: Responsible Data Management (2)
Thursday June 13 5:00 pm – 6:30 pm
Location: Tupungato
Session Chair: Divesh Srivastava
-
Query Refinement for Diverse Top-k Selection
Felix S Campbell (Ben-Gurion University of the Negev)*; Alon Silberstein (Ben Gurion University); Yuval Moskovitch (Ben Gurion University); Julia Stoyanovich (New York University) -
Equitable Top-k Results for Long Tail Data
Md Mouinul Islam (New Jersey Institute of Technology ); Mahsa Asadi (New Jersey Institute of Technology); Senjuti Basu Roy (NJIT)* -
FairHash: A Fair and Memory/Time-efficient Hashmap
Nima Shahbazi (University of Illinois at Chicago)*; Stavros Sintos (University of Illinois Chicago); Abolfazl Asudeh (University of Illinois Chicago) -
Fast Shapley Value Computation in Data Assemblage Tasks as Cooperative Simple Games
Xuan Luo (Simon Fraser University)*; Jian Pei (Simon Fraser University); Cheng Xu (Hong Kong Baptist University); Wenjie Zhang (University of New South Wales); Jianliang Xu (Hong Kong Baptist University) -
Relative Keys: Putting Feature Explanation into Context
Shuai An (University of Edinburgh); Yang Cao (University of Edinburgh)* -
Counterfactual Explanation at Will, with Zero Privacy Leakage
Shuai An (University of Edinburgh); Yang Cao (University of Edinburgh)*
Session 35: Security (3)
Thursday June 13 5:00 pm – 6:30 pm
Location: Puyehue/Calbuco
Session Chair: Mohammad Javad Amiri
-
Privacy Amplification by Sampling under User-level Differential Privacy
Juanru FANG (HKUST); Ke Yi (Hong Kong Univ. of Science and Technology)* -
Keep It Simple: Testing Databases via Differential Query Plans
Jinsheng Ba (National University of Singapore)*; Manuel Rigger (National University of Singapore) -
Continual Observation of Joins under Differential Privacy
Wei Dong (CMU)*; Zijun CHEN (HKUST); Qiyao Luo (Hong Kong University of Science and Technology); Elaine Shi (CMU); Ke Yi (Hong Kong Univ. of Science and Technology) -
Object-oriented Unified Encrypted Memory Management for Heterogeneous Memory Architectures
Mo Sha (Alibaba Group)*; Yifan Cai (University of Pennsylvania); Sheng Wang (Alibaba Group); Linh Thi Xuan Phan (University of Pennsylvania); Feifei Li (Alibaba Group); Kian-Lee Tan (National University of Singapore) -
Secure Sampling for Approximate Multi-party Query Processing
Qiyao Luo (Hong Kong University of Science and Technology); Yilei Wang (Alibaba Cloud); Ke Yi (Hong Kong Univ. of Science and Technology)*; Sheng Wang (Alibaba Group); Feifei Li (Alibaba Group)
Session 36: Data Integration + Provenance (2)
Thursday June 13 5:00 pm – 6:30 pm
Location: Antartica
Session Chair: Steven Whang
-
The Battleship Approach to the Low Resource Entity Matching Problem
Bar Genossar (Technion -- Israel Institute of Technology)*; Avigdor Gal (Technion -- Israel Institute of Technology); Roee Shraga (Northeastern University) -
Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation
Zhengjie Miao (Simon Fraser University); Jin Wang (Megagon Labs)* -
Unstructured Data Fusion for Schema and Data Extraction
kaiwen chen (university of Toronto)*; Nick Koudas (University of Toronto) -
R2D2: Reducing Redundancy and Duplication in Data Lakes
Raunak Shah (Adobe Research); Koyel Mukherjee (Adobe Research)*; Atharv Tyagi (Adobe); Dhruv Joshi (Indian Institute of Technology Kharagpur); Subrata Mitra (Adobe Research); Shivam Pravin Bhosale (Indian Institute of Technology Kharagpur); Sai Keerthana Karnam (IIT Kharagpur) -
DTT: An Example-Driven Tabular Transformer for Joinability by Leveraging Large Language Models
Arash Dargahi Nobari (University of Alberta)*; Davood Rafiei (University of Alberta) -
Discovering Functional Dependencies through Hitting Set Enumeration
Tobias Bleifuß (Hasso Plattner Institute)*; Thorsten Papenbrock (Philipps University of Marburg); Thomas Bläsius (Karlsruhe Institute of Technology); Martin Schirneck (Hasso Plattner Institute); Felix Naumann (Hasso Plattner Institute, University of Potsdam)
INDUSTRY SESSIONS
Session 1: Query Engines
Tuesday June 11 5:00 pm – 6:30 pm
Location: Europa
-
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine
Andrew Lamb (InfluxData); Yijie Shen (Space and Time); Daniël Heres (Coralogix); Jayjeet Chakraborty (UC Santa Cruz); Mehmet Ozan Kabak (Synnada); Liang-Chi Hsieh (Apple); Chao Sun (Apple) -
Unified Query Optimization in the Fabric Data Warehouse
Nicolas Bruno (Microsoft); Cesar A Galindo-Legaria (Microsoft); Milind Joshi (Microsoft); Esteban Calvo (Microsoft); Kabita Mahapatra (Microsoft); Sharon Ravindran (Microsoft); Guoheng Chen (Microsoft); Ernesto Cervantes Juarez (Microsoft); Beysim Sezgin (Microsoft) -
SQL with Measures
Julian Hyde (Google)*; John Fremlin (Google) -
ByteCard: Enhancing ByteDance's Data Warehouse with Learned Cardinality Estimation
Yuxing Han (ByteDance); WangHaoYu WangHaoYu (ByteDance); Lixiang Chen (Bytedance); Yifeng Dong (ByteDance); Xing Chen (ByteDance); Benquan Yu (Bytedance); Chengcheng Yang (East China Normal University); Weining Qian (East China Normal University) -
Automated Multidimensional Data Layouts in Amazon Redshift
Jialin Ding (Amazon Web Services); Matt Abrams (AWS); Sanghita Bandyopadhyay (AWS); Luciano Di Palma (AWS); Yanzhu Ji (AWS); Davide Pagano (AWS); Gopal Paliwal (AWS); Panos Parchas (AWS); Pascal Pfeil (Amazon Web Services ); Orestis Polychroniou (Amazon); Gaurav Saxena (Amazon); Aamer Shah (AWS); Amina Voloder (AWS); Sherry Xiao (AWS); Davis Zhang (AWS); Tim Kraska (AWS) -
Automated Clustering Recommendation With Database Zone Maps
Suratna Budalakoti (Oracle Corporation); Mohamed Ziauddin (Oracle USA); Andrew Witkowski (Oracle Corporation); You Jung Kim (Oracle Corporation); Ramarajan Krishnamachari (Oracle Corporation); Alan Wood (Oracle Labs)
Session 2: LLMs and ML Applications
Wednesday June 12 1:00 pm – 2:30 pm
Location: Europa
-
Similarity Joins of Sparse Features
Ahmed H Metwally (Uber); Michael Shum (MIT) -
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis
Chao Zhang (Zhejiang University); Yuren Mao (Zhejiang University); Yijiang Fan (Zhejiang University); Yu Mi (Zhejiang University); Yunjun Gao (Zhejiang University); Lu Chen (Zhejiang University); Dongfang Lou (Hundsun Technologies INC.); Jinshu Lin (Hundsun Research Institute) -
Rock: Cleaning Data by Embedding ML in Logic Rules
xianchun bao (sics); Zian Bao (SICS); bie binbin (Shenzhen Institute of Computing Science); QingSong Duan (Shenzhen Institute of Computing Science); Wenfei Fan (Univ. of Edinburgh ); hui lei (SICS); Daji Li (Shenzhen Institute of Computing Science); Wei Lin (Shenzhen Institute of Computing Science); peng liu (Shenzhen Institute of Computing Science); Lv Zhicong (Shenzhen Institute of Computing Science); Mingliang Ouyang (Shenzhen Institute of Computing Science); tang shuai (Shenzhen Institute of Computing Science); Yaoshu Wang (Shenzhen Institute of Computing Sciences, Shenzhen University); Qiyuan Wei (Shenzhen Institute of Computing Sciences); Min Xie (Shenzhen Institute of Computing Sciences ); Jing Zhang (Shenzhen Institute of Computing Science); zhang xin (Shenzhen Institute of Computing Science); zhao runxiao (Shenzhen Institute of Computing Science); zhou shuping (Shenzhen Institute of Computing Science) -
Data-Juicer: A One-Stop Data Processing System for Large Language Models
Daoyuan Chen (Alibaba Group); Yilun Huang (Alibaba Group); Ma Zhijian (Alibaba Group); Hesen Chen (Alibaba Group); Xuchen Pan (Alibaba Group); Ce Ge (Alibaba Group); Dawei Gao (Alibaba-inc); Yuexiang Xie (Alibaba Group); Zhaoyang Liu (Alibaba Group); Jinyang Gao (Alibaba Group); Yaliang Li (Alibaba Group); Bolin Ding ("Data Analytics and Intelligence Lab, Alibaba Group"); Jingren Zhou (Alibaba Group) -
The Hopsworks Feature Store for Machine Learning
Javier de la Rúa Martínez (Hopsworks AB and KTH Royal Institute of Technology); Fabio Buso (Hopsworks AB); Antonios Kouzoupis (Hopsworks AB); Alexandru A. Ormenisan (Hopsworks AB); Davit Bzhalava (Hopsworks AB); Salman Niazi (Hopsworks AB); Kenneth Mak (Hopsworks AB); Victor Jouffrey (Hopsworks AB); Mikael Ronström (Hopsworks AB); Ralfs Zangis (Hopsworks AB); Dhananjay Mukhedkar (Hopsworks AB); Raymond Cunningham (Hopsworks AB); Ayushman Khazanchi (KTH Royal Institute of Technology); Vladimir Vlassov (KTH Royal Institute of Technology, Stockholm, Sweden); Jim Dowling (KTH Royal Institute of Technology and Hopsworks AB) -
COSMO: A Large-Scale E-commerce Common Sense Knowledge Generation and Serving System at Amazon
Changlong Yu (HKUST); Xin Liu (Hong Kong University of Science and Technology); Jefferson Maia (Amazon); Tianyu Cao (Amazon); Yang Li (Amazon); Yifan Gao (The Chinese University of Hong Kong); Yangqiu Song (Hong Kong University of Science and Technology); Rahul Goutam (Amazon); Haiyang Zhang (Amazon); Bing Yin (Amazon); Zheng Li (Amazon)
Session 3: Cloud Storage
Wednesday June 12 3:00 pm – 4:30 pm
Location: Europa
-
LETUS: A Log-Structured Efficient Trusted Universal BlockChain Storage
Shikun Tian (Ant Group); Zhonghao Lu (Ant Group); Haizhen Zhuo (Ant Group); XiaoJing Tang (Ant Group); Peiyi Hong (Ant Group); shenglong chen (Ant Group); Dayi Yang (Ant Group); Ying Yan (Ant Group); Zhiyong Jiang (Ant group); Hui Zhang (Ant Group); Guofei Jiang (Ant Group) -
Vortex: A Stream-oriented Storage Engine For Big Data Analytics
Pavan Edara (Google); Jonathan Forbesj (Google); BIGANG LI (Google GCP) -
Native Cloud Object Storage in Db2 Warehouse: Implementing a Fast and Cost-Efficient Cloud Storage Architecture
David Kalmuk (IBM Analytics); Christian Garcia-Arellano (IBM Canada); Ronald Barber (IBM Research); Richard Sidle (IBM Research); Kostas Rakopoulos (IBM Canada); Hamdi Roumani (IBM Canada); William Minor (IBM Canada); Alexander Cheung (IBM Canada); Robert C. Hooper (IBM Canada); Matthew Emmerton (IBM Canada); Zach Hoggard (IBM Canada); Scott Walkty (IBM Canada); Patrick Perez (IBM Canada); Aleksandrs Santars (IBM Canada); Michael Chen (IBM Canada); Matthew Olan (IBM Canada); Daniel Zilio (IBM Canada); Imran Sayyid (IBM Canada); Humphrey Li (IBM Canada); Ketan Rampurkar (IBM Canada); Krishna K. Ramachandran (IBM); Yiren Shen (IBM Canada) -
ESTELLE: An Efficient and Cost-effective Cloud Log Engine
Yupu Zhang (University of Electronic Science and Technology of China); GuangLin Cong (Cloud Database Innovation Lab of Cloud BU, Huawei Technologies Co., Ltd.); Jihan Qu (University of Electronic Science and Technology of China); Xu Ran (Huawei); Yuan Fu (University of Electronic Science and Technology of China); Weiqi Li (Cloud Database Innovation Lab of Cloud BU, Huawei Technologies Co., Ltd.); Feiran Hu (Cloud Database Innovation Lab of Cloud BU, Huawei Technologies Co., Ltd.); Jing Liu (Cloud Database Innovation Lab of Cloud BU, Huawei Technologies Co., Ltd.); Wenliang Zhang (Cloud Database Innovation Lab of Cloud BU, Huawei Technologies Co., Ltd.); Kai Zheng (University of Electronic Science and Technology of China) -
TimeCloth: Fast Point-in-Time Database Recovery in The Cloud
Jianjun Deng (Alibaba Group); Jianan Lu (Princeton University); Hua Fan (Alibaba Group); Chaoyang Liu (Alibaba Group Hangzhou); Shi Cheng (Alibaba Cloud); Cuiyun Fu (Alibaba Group); Wenchao Zhou (Alibaba Group)
Session 4: Cloud Databases
Thursday June 13 1:00 pm – 2:30 pm
Location: Llaima
-
Proactive Resume and Pause of Resources for Microsoft Azure SQL Database Serverless
Olga Poppe (Microsoft); Pankaj Arora (Microsoft); Sakshi Sharma (Microsoft); Jie Chen (Microsoft); Willis Lang (Microsoft); Qun Guo (Microsoft); Sachin Pandit (Microsoft); Vaishali Jhalani (Microsoft); Rahul Sawhney (Microsoft); Anupriya Inumella (Microsoft); Sanjana Dulipeta Sridhar (Microsoft); Dheren Gala (Microsoft); Nilesh Rathi (Microsoft); Morgan Oslake (Microsoft); Alexandru Chirica (Microsoft); Sarika Iyer (Microsoft); Prateek Goel (Microsoft); Ajay Kalhan (Microsoft) -
Vertically Autoscaling Monolithic Applications with CaaSPER: Scalable Container-as-a-Service Performance Enhanced Resizing Algorithm for the Cloud
Anna Pavlenko (Microsoft Gray Systems Lab); Joyce Cahoon (Microsoft); Yiwen Zhu (Microsoft); Brian Kroth (Microsoft); Michael Nelson (Microsoft); Andrew Carter (Microsoft); David Liao (Microsoft); Travis Wright (Microsoft); Jesús Camacho-Rodríguez (Microsoft); Karla Saur (Microsoft)* -
Flux: Decoupled Auto-Scaling for Heterogeneous Query Workload in Alibaba AnalyticDB
Wei Li (Alibaba Group); Jiachi Zhang (Alibaba Group); Ye Yin (AlibabaCloud); Yan Li (Alibaba Group); Zhanyang Zhu (Alibaba Group); Wenchao Zhou (Alibaba Group); Liang Lin (Alibaba); Feifei Li (Alibaba Group) -
Intelligent Scaling in Amazon Redshift
Vikram Nathan (Amazon, Inc.); Vikramank Y Singh (Amazon); Zhengchun Liu (Amazon); Mohammad Rahman (Amazon, Inc.); Andreas Kipf (UTN); Dominik Horn (Amazon, Inc.); Davide Pagano (Amazon, Inc.); Gaurav Saxena (Amazon); Balakrishnan Narayanaswamy (Amazon); Tim Kraska (AWS) -
Stage: Query Execution Time Prediction in Amazon Redshift
Ziniu Wu (Massachusetts Institute of Technology); Ryan Marcus (University of Pennsylvania); Zhengchun Liu (Amazon); Parimarjan Negi (MIT CSAIL); Vikram Nathan (MIT); Pascal Pfeil (Amazon Web Services ); Gaurav Saxena (Amazon); Mohammad Rahman (Amazon Web Services); Balakrishnan Narayanaswamy (Amazon); Tim Kraska (MIT)
Session 5: Cloud Database Architecture
Thursday June 13 3:00 pm – 4:30 pm
Location: Llaima
-
PolarDB-MP: A Multi-Primary Cloud-Native Database via Disaggregated Shared Memory
xinjun Yang (Alibaba Group); Yingqiang Zhang (Alibaba Group); Hao Chen (Alibaba Group ); Feifei Li (Alibaba Group); Bo Wang (Alibaba Group); Jing Fang (Alibaba Group); Chuan Sun (Alibaba Group); Yuhui Wang (Alibaba Group) -
Amazon MemoryDB: A Fast and Durable Memory-First Cloud Database
Yacine Taleb (AWS); Kevin Mcgehee (AWS); Nan Yan (AWS); Shawn Wang (AWS); Stefan Mueller (AWS); Allen Samuels (AWS) -
Extending Polaris to Support Transactions
Josep Aguilar Saborit (Microsoft); Alan Halverson (Microsoft); Raghu Ramakrishnan (Microsoft); Kevin Bocksrocker (Microsoft) -
BigLake: BigQuery's Evolution toward a Multi-Cloud Lakehous
Justin Levandoski (Google); Garrett Casto (Google); Mingge Deng (Google); Rushabh Desai (Google); Pavan Edara (Google); Thibaud Hottelier (Google); Amir Hormati (Google); Anoop Johnson (Google); Jeff Johnson (Google); Dawid Kurzyniec (Google); Sam McVeety (Google); Prem Ramanathan (Google); Gaurav Saxena (Google); Vidya Shanmugan (Google); Yuri Volobuev (Google) -
Predicate Caching: Query-Driven Secondary Indexing for Cloud Data Warehouses
Tobias Schmidt (TUM); Andreas Kipf (UTN); Dominik Horn (Amazon Web Services); Gaurav Saxena (Amazon); Tim Kraska (AWS)
Session 6: Graph Data Management
Thursday June 13 5:00 pm – 6:30 pm
Location: Llaima
-
BG3: A Cost Effective and I/O Efficient Graph Database in Bytedance
wei zhang (bytedance); cheng chen (bytedance); Qiange Wang (National University of Singapore); wei wang (bytedance); shijiao yang (bytedance); bingyu zhou (bytedance); huiming zhu (bytedance); Chao Chen (ByteDance); yongjun zhao (bytedance); Yingqian HU (ByteDance); miaomiao cheng (bytedance); Meng LI (ByteDance); Hongfei Tan (ByteDance); Mengjin Liu (ByteDance); hexiang lin (bytedance); Shuai Zhang (Bytedance); Lei Zhang (ByteDance) -
PG-Triggers: Triggers for Property Graphs
Stefano Ceri (Politecnico di Milano); Anna Bernasconi (Politecnico di Milano); Alessia Ms. Gagliardi (Politecnico di Milano); Davide Martinenghi (Politecnico di Milano); Luigi Bellomarini (Banca d'Italia); Davide Magnanimi (Banca d'Italia) -
GraphScope Flex: LEGO-like Graph Computing Stack
Tao He (Alibaba Group); Shuxian Hu (Alibaba Group); Longbin Lai (Alibaba Group); Dongze Li (Alibaba Group); Neng Li (Alibaba); Xue Li (Alibaba Group); Lexiao Liu (Alibaba Group); Luo Xiaojian (Alibaba group); Bingqing Lyu (Alibaba Group); Ke Meng (Alibaba Group); Sijie Shen (Alibaba Group); Li Su (Alibaba Group); Lei Wang (Alibaba Group); Jingbo Xu (Alibaba Group); Wenyuan Yu (Alibaba Group); Weibin Zeng (Alibaba); Lei Zhang (Alibaba); Siyuan Zhang (Alibaba Group); Jingren Zhou (Alibaba Group); XiaoLi Zhou (阿里巴巴); Diwen Zhu (Alibaba) -
Bouncer: Admission Control with Response Time Objectives for Low-latency Online Data Systems
Hao Xu (LinkedIn); Juan Colmenares (LinkedIn) -
NPA: Improving Large-scale Graph Neural Networks with Non-parametric Attention
Wentao Zhang (Peking University); Guochen Yan (Peking University); Yu Shen (Peking University); Ling Yang (Peking University); Yangyu Tao (Tencent); Bin Cui (Peking University); Jian Tang (HEC Montreal; Mila)
DEMO SESSIONS
Group A
Tuesday June 11 1:00 pm – 2:30 pm
Location: Europa
Thursday June 123 5:00 pm – 6:30 pm
Location: Europa
-
Demonstration of Ver: View Discovery in the Wild
Kevin Dharmawan (University of Indonesia); Chirag A Kawediya (University of Chicago); Yue Gong (The University of Chicago); Zaki Indra Yudhistira (University of Indonesia); Zhiru Zhu (University of Chicago); Sainyam Galhotra (Cornell University); Adila Krisnadhi; Raul Castro Fernandez (The University of Chicago) -
Comquest: Large Scale User Comment Crawling and Integration
Zhijia Chen (Temple University); Lihong He (IBM Almaden Research Center); Eduard Dragut (Temple Univ.); Arjun Mukherjee (University of Houston) -
QueryShield: Cryptographically Secure Analytics in the Cloud
Ethan Seow (Boston University); Yan Tong (University of California, Santa Cruz); Eli M Baum (Boston University); Samuel M Buxbaum (Boston University); Muhammad Faisal (Boston University); John Liagouris (Boston University); Vasiliki Kalavri (Boston University); Mayank Varia (Boston University) -
SIERRA: A Counterfactual Thinking-based Visual Interface for Property Graph Query Construction
Jiebing Ma (Nanyang Technological University); Sourav S Bhowmick (Nanyang Technological University); Lester Tay (Nanyang Technological University); Byron Choi (Hong Kong Baptist University) -
Sawmill: From Logs to Causal Diagnosis of Large Systems
Markos Markakis (Massachusetts Institute of Technology); An Bo Chen (Massachusetts Institute of Technology); Brit Youngmann (Technion - Israel institute of technology); Trinity Gao (MIT); Ziyu Zhang (MIT); Rana Shahout (Harvard); Peter Chen (Massachusetts Institute of Technology); Chunwei Liu (MIT); Ibrahim Sabek (University of Southern California); Michael Cafarella (MIT CSAIL) -
Demonstrating REmatch: a novel regex engine for finding all matche
Kyle Bossonney (Oxford University); Vicente Calisto (PUC Chile); Cristian Riveros (PUC Chile); Gustavo Toro (PUC Chile); Nicolás A Van Sint Jan (PUC); Domagoj Vrgoc (Pontificia Universidad Catolica de Chile) -
ASQP-RL Demo: Learning Approximation Sets for Exploratory Queries
Susan B Davidson (University of Pennsylvania); Tova Milo (Tel Aviv University); Kathy Razmadze (Tel Aviv University); Gal Zeevi (Tel Aviv University) -
IMBridge: Impedance Mismatch Mitigation between Database Engine and Prediction Query Execution
Chenyang Zhang (East China Normal University); Junxiong Peng (East China Normal University); Chen Xu (East China Normal University); Quanqing Xu (OceanBase, Ant Group ); Chuanhui Yang (OceanBase) -
ASM in Action: Fast and Practical Learned Cardinality Estimation
Sangoh Lee (POSTECH); Kyoungmin Kim (EPFL); Wook-Shin Han (POSTECH) -
The Game Of Recourse: Simulating Algorithmic Recourse over Time to Improve Its Reliability and Fairness
Andrew L Bell (New York University); Joao Fonseca (NOVA Information Management School); Julia Stoyanovich (New York University) -
RobOpt: A Tool for Robust Workload Optimization Based on Uncertainty-Aware Machine Learning
Amin Kamali (University of Ottawa); Verena Kantere (National Technical University of Athens); Calisto Zuzarte (IBM); Vincent Corvinelli (IBM) -
Demonstrating CAESURA: Language Models as Multi-Modal Query Planners
Matthias Urban (Technical University of Darmstadt); Carsten Binnig (TU Darmstadt) -
Demonstration of Udon: Line-by-line Debugging of User-Defined Functions in Data Workflows
Yicong Huang (UC Irvine); Zuozhi Wang (U C Irvine); Chen Li (UC Irvine) -
UniTS: A Universal Time Series Analysis Framework Powered by Self-supervised Representation Learning
Zhiyu Liang (Harbin Institute of Technology); Chen Liang (Harbin Institute of technology); Zheng Liang (Harbin Institute of Technology); Hongzhi Wang (Harbin Institute of Technology); Bo Zheng (CnosDB Inc.) -
CHatPipe: Orchestrating Data Preparation Pipelines by Optimizing Human-ChatGPT Interactions
Sibei Chen (Renmin University of China); Hanbing Liu (Renmin University of China); Waiting Jin (Renmin University of China); Xiangyu Sun (Renmin University of China); Xiaoyao Feng (Renmin University of China); Ju Fan (Renmin University of China); Xiaoyong Du (Renmin University of China); Nan Tang (Qatar Computing Research Institute, HBKU)
Group B
Tuesday June 11 3:00 pm – 4:30 pm
Location: Europa
Thursday June 13 3:00 pm – 4:30 am
Location: Europa
-
Responsible Model Selection with Virny and VirnyView
Denys Herasymuk (Ukrainian Catholic University); Falaah Arif Khan (New York University); Julia Stoyanovich (New York University) -
Property Graph Stream Processing In Action with Seraph
Riccardo Tommasini (INSA Lyon - LIRIS); Christopher Rost (University of Leipzig); Angela Bonifati (Univ. of Lyon); Emanuele Della Valle (Politecnico di Milano); Erhard Rahm (University of Leipzig); Keith Hare (JCC Consulting, Inc.); Stefan Plantikow (Neo4j); Petra Selmer (Bloomberg LP); Hannes Voigt (Neo4j) -
MillenniumDB: A multi-modal, multi-model graph database engine
Domagoj Vrgoč (PUC); Carlos Rojas (PUC Chile); Renzo Angles (Universidad de Talca); Marcelo Arenas (Universidad Catolica & IMFD Chile); Vicente Calisto (IMFD); Benjamín F. Farías (Pontificia Universidad Catolica); Sebastián Ferrada (IMFD); Tristan Heuer (IMFD); Aidan Hogan (Universidad de Chile, Chile); Gonzalo Navarro (University of Chile); Alexander Pinto (PUC); Juan Reutter (PUC); Henry Rosales (IMFD); Etienne Toussiant (IMFD) -
IDE: A System for Iterative Mislabel Detection
Yuhao Deng (Beijing Institute of Technology); Deng Qiyan (Beijing Institute of Technology); Chengliang Chai (Beijing Institute of Technology); Lei Cao (University of Arizona/MIT); Nan Tang (HKUST (GZ)); Ju Fan (Renmin University of China); Jiayi Wang (Tsinghua University); Ye Yuan ( Beijing Institute of Technology); Guoren Wang (Beijing Institute of Technology) -
A Demonstration of GPTuner: A GPT-Based Manual-Reading Database Tuning System
Jiale Lao (Sichuan University); Yibo Wang (Sichuan University); Yufei Li (Sichuan University); Jianping Wang (Northwest Normal University); Yunjia Zhang (University of Wisconsin-Madison); Zhiyuan Cheng (Purdue University); Wanghu Chen (Northwest Normal University); Yuanchun Zhou (Computer Network Information Center, Chinese Academy of Sciences); Mingjie Tang (Sichuan University); Jianguo Wang (Purdue University) -
Demonstrating 𝜆-Tune: Exploiting Large Language Models for Workload-Adaptive Database System Tuning
Victor Giannakouris (Cornell University); Immanuel Trummer (Cornell University) -
User-friendly, Interactive, and Configurable Explanations for Graph Neural Networks with Graph Views
Tingyang Chen (Zhejiang University); Dazhuo Qiu (Aalborg University); Yinghui Wu (Case Western Reserve University); Arijit Khan (Aalborg University); Xiangyu Ke (Zhejiang University, China); Yunjun Gao (Zhejiang University) -
OpenIVM: a SQL-to-SQL Compiler for Incremental Computations
Ilaria Battiston (CWI); Peter Boncz (Centrum Wiskunde & Informatica); Kriti Kathuria (UWaterloo) -
Building Reactive Large Language Model Pipelines with Motion
Shreya Shankar (University of California Berkeley); Aditya G Parameswaran (U Illinois) -
Demonstrating Nexus for Correlation Discovery over Collections of Spatio-Temporal Tabular data
Yue Gong (The University of Chicago); Raul Castro Fernandez (The University of Chicago) -
PLUTUS: Understanding Data Distribution Tailoring for Machine Learning
Jiwon Chang (University of Rochester); Christina Dionysio (Technsiche Universität Berlin); Fatemeh Nargesian (University of Rochester); Matthias Boehm (Technische Universität Berlin) -
Multi-Backend Zonal Statistics Execution with Raven
Gereon Dusella (Technische Universität Berlin); Haralampos Gavriilidis (Technische Universität Berlin); Laert Nuhu (Deutsche Kreditbank AG); Volker Markl (Technische Universität Berlin); Eleni Tzirita Zacharatou (IT University of Copenhagen) -
ShiftScope: Adapting Visualization Recommendations to Users' Dynamic Data Focus
Sanad Saha (Oregon State University); Nischal Aryal (Oregon State University); Leilani Battle (University of Washington); Arash Termehchy (Oregon State University) -
Demonstration of ElasticNotebook: Migrating Live Computational Notebook States
Zhaoheng Li (University of Illinois at Urbana-Champaign); Supawit Chockchowwat (University of Illinois at Urbana-Champaign); Hanxi Fang (University of Illinois at Urbana Champaign); Ribhav Sahu (University of Illinois Urbana-Champaign); Sumay Thakurdesai (University of Illinois at Urbana-Champaign); Kantanat Pridaphatrakun (University of Illinois Urbana-Champaign); Yongjoo Park (University of Illinois at Urbana-Champaign)
PANEL DISCUSSIONS
The Future of Graph Analytics
Tuesday June 11 1:00 pm – 2:30 pm
Location: Parinacota
Moderator: Angela Bonifati (University of Lyon)
Panelists:
- Tamer Ozsu (University of Waterloo)
- Yuanyuan Tian (Microsoft Gray Systems Lab)
- Hannes Voigt (Neo4j)
- Wenyuan Yu (Alibaba Group)
- Wenjie Zhang (University of New South Wales)
AI for Systems
Thursday June 13 1:00 pm – 2:30 pm
Location: Parinacota
Organizers: Raghu Ramakrishnan (Microsoft) and Carlo Curino (Microsoft -- GSL)
Moderator: Raghu Ramakrishnan (Microsoft)
Panelists:
- Tim Kraska (MIT & Amazon Web Services)
- Yuanyuan Tian (Microsoft Gray Systems Lab)
- Bolin Ding ("Data Analytics and Intelligence Lab, Alibaba Group")
- Fatma Ozcan (Google)
POSTER SESSIONS
Each of the 3 SIGMOD days has a plenary poster session where all SIGMOD papers presented on that day will get a poster spot. In addition, PODS authors who have expressed interest will also get a poster spot on Tue and Wed.
Session 1
Tuesday June 11 10:30 am – 11:30 am
Location: Foyer Los Volcanes and Las Américas
Session 2
Wednesday June 12 10:30 am – 11:30 am
Location: Foyer Los Volcanes and Las Américas
Session 3
Thursday June 13 10:30 am – 11:30 am
Location: Foyer Los Volcanes and Las Américas
TUTORIAL SESSIONS
Tutorial 1: Demystifying Data Management for Large Language Models
Xupeng Miao (Carnegie Mellon University); Zhihao Jia (Carnegie Mellon University); Bin Cui (Peking University)
Sunday June 9 8:30 am – 10:00 am
Location: Parinacota
Tutorial 2: SmartNICs in the Cloud: The Why, What and How of In-network Processing for Data-Intensive Applications
Faeze Faghih (Technical University of Darmstadt); Tobias Ziegler (Technical University of Darmstadt); Zsolt István (Systems Group, TU Darmstadt); Carsten Binnig (TU Darmstadt & DFKI)
Sunday June 9 8:30 pm – 10:00 am, 10:30 am – 12:30 pm
Location: Llaima
Tutorial 3: Learned Query Optimizer: What is New and What is Next
Rong Zhu (Alibaba Group); Lianggui Weng (Alibaba Group); Bolin Ding (Alibaba Group); Jingren Zhou (Alibaba Group)
Sunday June 9 10:30 am – 12:30 pm
Location: Parinacota
Tutorial 4: Distributed Transaction Processing in Untrusted Environments
Mohammad Javad Amiri (Stony Brook University); Divyakant Agrawal (University of California Santa Barbara); Amr El Abbadi (University of California Santa Barbara); Boon Thau Loo (University of Pennsylvania)
Sunday June 9 2:00 pm – 3:30 pm, 4:00 pm – 6:00 pm
Location: Parinacota
Tutorial 5: Responsible Sharing of Spatiotemporal Data
Raul Castro Fernandez (The University of Chicago); Arnab Nandi (The Ohio State University)
Sunday June 9 2:00 pm – 3:30 pm
Location: Llaima
Tutorial 6: Querying Graph Databases at Scale
Aidan Hogan (DCC, University of Chile); Domagoj Vrgoč (Pontificia Universidad Católica de Chile and IMFD Chile)
Sunday June 9 4:00 pm – 6:00 pm
Location: Llaima
Tutorial 7: Cognitive Psychology Meets Data Management: State of the Art and Future Directions
Sourav S Bhowmick (Nanyang Technological University); S. H. Annabel Chen (Nanyang Technological University); Divesh Srivastava (AT&T Chief Data Office)
Friday June 14 8:30 am – 10:00 am
Location: Parinacota
Tutorial 8: Vector Database Management Techniques and Systems
James Jie Pan (Tsinghua University); Jianguo Wang (Purdue University); Guoliang Li (Tsinghua University)
Friday June 14 8:30 am – 10:00 am
Location: Llaima
Tutorial 9: An Overview of Continuous Querying in (Modern) Data Systems
Riccardo Tommasini (LIRIS - INSA de Lyon); Angela Bonifati (Lyon 1 University, CNRS Liris, IUF)
Friday June 14 10:30 am – 12:30 pm
Location: Parinacota
Tutorial 10: SIMDified Data Processing - Foundations, Abstraction, and Advanced Techniques
Dirk Habich (TU Dresden); Johannes Pietrzyk (TU Dresden)
Friday June 14 10:30 am – 12:30 pm
Location: Llaima
Tutorial 11: Machine Learning for Databases: Foundations, Paradigms, and Open problems
Gao Cong (Nanyang Technological University); Jingyi Yang (Nanyang Technological University); Yue Zhao (Nanyang Technological University)
Friday June 14 2:00 pm – 3:30 pm, 4:00 pm – 6:00 pm
Location: Parinacota
Tutorial 12: Applications and Computation of the Shapley Value in Databases and Machine Learning
Xuan Luo (Simon Fraser University); Jian Pei (Duke University)
Friday June 14 2:00 pm – 3:30 pm
Location: Llaima
Tutorial 13: Beyond Bloom: A Tutorial on Future Feature-Rich Filters
Prashant Pandey (University of Utah); Martín Farach-Colton (New York University); Niv Dayan (University of Toronto); Huanchen Zhang (Tsinghua University)
Friday June 14 4:00 pm – 6:00 pm
Location: Llaima
NEW RESEARCHER SYMPOSIUM
New Researcher Symposium Panel
Wednesday June 12 3:00 pm - 4:30 pm
Location: Puyehue/Calbuco
Moderators: Sainyam Galhotra (Cornell University) and Jia Zou (Arizona State Univ)
Distinguished Panelist:
C Mohan (Hong Kong Baptist University)
Panelists:
- Sourav Bhowmick (Nanyang Technological University)
- Anja Gruenheid (Microsoft)
- Sujaya Maiyya (University of Waterloo)
- Immanuel Trummer (Cornell University)
- Li Xiong (Emory University)
- Ce Zhang (University of Chicago)
New Researcher Symposium Mentoring Session
Thursday June 13 1:00 pm - 2:30 pm
Location: Aconcagua
Organizers: Sainyam Galhotra (Cornell University) and Jia Zou (Arizona State Univ)
Sponsors Talks
AMAZON
Oceanía room
Tuesday 11 June - 13:00 to 14:30 hrs
Data Management innovation at Amazon Web Services
Speaker: Ippokratis Pandis – Vice President/Distinguished Engineer, Amazon Web Services
Aconcagua room
Tuesday 11 June - 13:00 to 14:00 hrs
Data and AI in Databases and Analytics at Google
Speaker: Fatma Ozcan and Justin Levandoski
HUAWEI
Aconcagua room
Tuesday 11 June - 14:00 to 15:00 hrs
Innovation in Cloud-native GaussDB for higher performance, higher availability and higher intelligence.
Speaker: Lei Wang
ALIBABA
Aconcagua room
Tuesday 11 June - 15:00 to 16:00 hrs
GraphScope's Journey with Graph Computing: Progress and Lessons
Speaker: Wenyuan Yu
SNOWFLAKE
Oceanía room
Wednesday 12 June - 14:30 to 15:30
Efficient query processing and searching in Snowflake
Speaker: Alejandro Salinger
BYTE DANCE
Oceanía room
Wednesday 12 June - 15:30 to 16:30
Infrastructure system challenges and research at ByteDance
Speaker: Dr. Jianjun Chen
ORACLE
Oceanía room
Wednesday 12 June - 17:30 to 18:00
Intention is all we need to create data apps
Speaker: Danica Porobic
MICROSOFT
Aconcagua room
Thursday 13 June - 13:00 to 14:00 hrs
Microsoft Fabric: Open Lakes, Not Walled Gardens
Speaker: Raghu Ramakrishnan
DEI
DEI Panel: Global Voices in Data: Navigating Responsible Management and Processing with Diverse Perspectives
Wednesday June 12 5:00 pm – 6:30 pm
Location: Europa
Moderator: Genoveva Vargas-Solar, CNRS, LIRIS
Webpage
Panelists:
- Ricardo Baeza-Yates, Institute for Experiential AI, Northeastern University & DCC, Universidad de Chile
- Jocelyn Dunstan, UC Chile
- Sourav S Bhowmick, Nanyang Technology University
- Jennafer Shae, Roberts, Accel.AI
Birds of a Feather
Wednesday June 12 11:30 am – 12:30 am
Location: Antartica
Moderators: Jesús Camacho Rodríguez (Microsoft), Aidan Hogan (University of Chile)
Interactive session to share statistics, feedback and strategies relating to DEI at SIGMOD/PODS and related events.
PODS Special Event
PODS Special Event: Unlocking the Secrets: Bridging Theory and Practice
Sunday June 9 4:00 pm – 6:00 pm
Location: Aconcagua
Moderator: Floris Geerts (University of Antwerp) and Dan Olteanu (University of Zurich)
Webpage
Panelists:
- Chris Jermaine, Rice university
- Renee J. Miller, Northeastern University
- Hung Ngo, Relational AI