Skip to content

revectores/awesome-mmdb-paper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

Multi-Model Database

Multi-Model Database Survey

  1. UDBMS: Road to Unification for Multi-model Data Management, CoRR'16
  2. Multi-model Data Management: What's New and What's Next?, EDBT'17
  3. The Multi-model Databases – A Review, BDAS'17
  4. Multi-Model Database Management Systems - a Look Forward, Poly/DMAH@VLDB'18
  5. (Tutorial) Multi-model Databases and Tightly Integrated Polystores: Current Practices, Comparisons, and Open Challenges, CIKM'18
  6. Multi-model Databases: A New Journey to Handle the Variety of Data, CSUR'19
  7. Multi-Model Data Modeling and Representation: State of the Art and Research Challenges, IDEAS'21

Multi-Model Query Processing

  1. Query processing in multistore systems: an overview, CIKM'16

Multi-Model Query Language

  1. Multi-SQL: An extensible multi-model data query language, CoRR'20
  2. (Toturial) Multi-Model Data Query Languages and Processing Paradigms, CIKM'20
  3. (Survey) Multi‑model query languages: taming the variety of big data, DPD'23

Category Theory Based Modeling

  1. (Demo) MultiCategory: Multi-model Query Processing Meets Category Theory and Functional Programming, VLDB'21
  2. MM-cat: A Tool for Modeling and Transformation of Multi-Model Data using Category Theory, MODEL'21
  3. Unifying Categorical Representation of Multi-Model Data, SAC'22
  4. A Unified Representation and Transformation of Multi‑Model Data Using Category Theory, BigData'22
  5. A Universal Approach for Simplified Redundancy-Aware Cross-Model Querying, IS'24

Polystore Systems

  1. The Case for Polystores, Stonebraker'15

Myria

  1. Demonstration of the Myria Big Data Management Service, SIGMOD'14
  2. The Myria Big Data Management and Analytics System and Cloud Service, CIDR'17

BigDAWG

  1. The BigDAWG Polystore System, SIGMOD'15
  2. BigDAWG Polystore Query Optimization Through Semantic Equivalences, HPEC'16
  3. The BigDAWG Polystore System and Architecture, CoRR'16
  4. Demonstrating the BigDAWG Polystore System for Ocean Metagenomic Analysis, CIDR'17

TATOOINE

  1. Mixed-instance querying: a lightweight integration architecture for data journalism, VLDB'16

Hybrid

  1. Hybrid.media: High Velocity Video Ingestion in an In-Memory Scalable Analytical Polystore, BIGDATA'17
  2. Hybrid.Poly: A Consolidated Interactive Analytical Polystore System, ICDE'19

Rheem

  1. Rheem: Enabling Multi-Platform Task Execution, SIGMOD'16
  2. RHEEM: enabling cross-platform data processing: may the big data be with you!, VLDB'18
  3. RheemStudio: Cross-Platform Data Analytics Made Easy, ICDE'18
  4. RHEEMix in The Data Jungle: A Cost-based Optimizer for Cross-platform Systems, VLDB'20
  5. Apache Wayang: A Unified Data Analytics Framework, SIGMOD RECORD'23

Estocada

  1. Towards Scalable Hybrid Stores: Constraint-Based Rewriting to the Rescue, SIGMOD'19
  2. ESTOCADA: Towards Scalable Polystore Systems, VLDB'20

AWESOME

  1. Processing Analytical Queries in the AWESOME Polystore [Technical Report], CoRR'21
  2. AWESOME: Empowering Scalable Data Science on Social Media Data with an Optimized Tri-Store Data System, CoRR'21
  3. An Optimized Tri-store System for Multi-model Data Analytics (Technical Report), CoRR'23

Multi-Model Schema Management

Multi-Model Schema Design

  1. Data variety, come as you are in multi-model data warehouses, IS'21
  2. Logical Design of Multi-Model Data Warehouses, KAIS'22
  3. To Each His Own: Accommodating Data Variety by a Multimodel Star Schema, DOLAP@EDBT/ICDT'20

Multi-Model Schema Evolution

  1. Synchronization of Queries and Views Upon Schema Evolutions: A Survey, CSUR'16
  2. MM-evolver: A Multi-Model Evolution Management Tool, EDBT'19
  3. Evolution management in multi-model databases, DKE'21
  4. MM-evocat: A Tool for Modelling and Evolution Management of Multi-Model Data, CIKM'22
  5. Modelling and Evolution Management of Multi-Model Data, SAC'24
  6. A Generic Schema Evolution Approach for NoSQL and Relational Databases, TKDE'24

Multi-Model Schema Inference

  1. MM-infer: A Tool for Inference of Multi-Model Schemas, EDBT'22
  2. A Universal Approach for Multi‑Model Schema Inference, BigData'22

Adaptive Multi-Model Database

  1. (Envision) Self-Adapting Design and Maintenance of Multi-Model Databases, IDEAS'22
  2. Parameters Tuning of Multi-Model Database based on Deep Reinforcement Learning, JIIS'22

Multi-Model Database Benchmarking

  1. TPC-DI: The First Industry Benchmark for Data Integration, VLDB'14
  2. Performance Evaluation of NoSQL Multi-Model Data Stores in Polyglot Persistence Applications, IDEAS'16
  3. Towards Benchmarking Multi-Model Databases, CIDR'17
  4. PolyBench: The First Benchmark for Polystores, TPCTC'18
  5. UniBench: A Benchmark for Multi-Model Database Management Systems, TPCTC'18
  6. Holistic evaluation in multi-model databases benchmarking, DPD'19
  7. How well a multi-model database performs against its single-model variants: Benchmarking OrientDB with Neo4j and MongoDB, FedCSIS'20
  8. (Doctoral thesis) Performance Benchmarking and Query Optimization for Multi-Model Databases, ChaoZhang'21
  9. M2Bench: A Database Benchmark for Multi-Model Analytic Workloads, VLDB'22
  10. A Benchmark for Performance Evaluation of a Multi-Model Database vs. Polyglot Persistence, JDM'23
  11. A Comparative Performance Evaluation of Multi-Model NoSQL Databases and Polyglot Persistence, SAC'23

Multi-Model Evolution Benchmarking

  1. EvoBench – A Framework for Benchmarking Schema Evolution in NoSQL, BigData'20
  2. EvoBench: Benchmarking Schema Evolution in NoSQL, TPCTC'21

Multi-Modal Database

Vector Database

  1. Milvus - A Purpose-Built Vector Data Management System, SIGMOD'21
  2. TASTI: Semantic Indexes for Machine Learning-based Queries over Unstructured Data, SIGMOD'22
  3. A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge, CoRR'23
  4. When Large Language Models Meet Vector Databases: A Survey, CoRR'24
  5. Survey of Vector Database Management Systems, VLDB'24
  6. Rethinking Similarity Search: Embracing Smarter Mechanisms over Smarter Data, CoRR'23
  7. Analyzing Embedding Models for Embedding Vectors in Vector databases, ICTBIG'23

ANNS

Graph-Based Index

  1. Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs, TPAMI'18
  2. A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search, VLDB'21
  3. Revisiting the Index Construction of Proximity Graph-Based Approximate Nearest Neighbor Search, CoRR'24

Tree-Based Index

Quantization-Based Index

  1. Product Quantization for Nearest Neighbor Search, TPAMI'11

Hash-Based Index

  1. DB-LSH 2.0: Locality-Sensitive Hashing With Query-Based Dynamic Bucketing, ICDE'22
  2. Towards Efficient Index Construction and Approximate Nearest Neighbor Search in High-Dimensional Spaces, VLDB'23

Cluster-Based Index

  1. SPFresh: Incremental In-Place Update for Billion-Scale Vector Search, SOSP'23

Constrained ANNS

  1. AnalyticDB-V: a hybrid analytical engine towards query fusion for structured and unstructured data, VLDB'20
  2. PASE: PostgreSQL Ultra-High-Dimensional Approximate Nearest Neighbor Search Extension, SIGMOD'20
  3. HQANN: Efficient and Robust Similarity Search for Hybrid Queries with Structured and Unstructured Constraints, CIKM'22
  4. VBase: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity, OSDI'23
  5. An Efficient and Robust Framework for Approximate Nearest Neighbor Search with Attribute Constraint, NeurIPS'23

ANNS Benchmarking

  1. ANN-Benchmarks: A Benchmarking Tool for Approximate Nearest Neighbor Algorithms, IS'20
  2. Approximate Nearest Neighbor Search on High Dimensional Data - Experiments, Analyses, and Improvement, TKDE'20
  3. Reproducibility protocol for ANN-Benchmarks: A benchmarking tool for approximate nearest neighbor search algorithm
  4. VectorDBBench

Explicit Structured Multi-Modal

  1. PandaDB: Understanding Unstructured Data in Graph Database, CoRR'21
  2. PandaDB: an AI-native graph database for unified managing structured and unstructured data, DASFAA'23
  3. A Model and Query Language for Multi-modal Hybrid Query, SSDBM'24
  4. MMDBench: A Benchmark for Hybrid Query in Multimodal Database, Benchcouncil'24

Visual Database

  1. A Survey on Visual Content-Based Video Indexing and Retrieval, SMC'11
  2. Optasia: A Relational Platform for Efficient Large-Scale Video Analytics, SoCC'16
  3. NoScope: Optimizing Neural Network Queries over Video at Scale, VLDB'17
  4. Scanner: Efficient Video Analysis at Scale, ToG'18
  5. Accelerating Machine Learning Inference with Probabilistic Predicates, SIGMOD'18
  6. LightDB: A DBMS for Virtual Reality Video, VLDB'18
  7. Physical Representation-based Predicate Optimization for a Visual Analytics Database, ICDE'19
  8. VISTA: Optimized System for Declarative Feature Transfer from Deep CNNs at Scale, SIGMOD'20
  9. MIRIS: Fast Object Track Queries in Video, SIGMOD'20
  10. Jointly Optimizing Preprocessing and Inference for DNN-based Visual Analytics, VLDB'21
  11. Optimizing Video Analytics with Declarative Model Relationships, VLDB'22
  12. Optimizing Machine Learning Inference Queries with Correlative Proxy Models, VLDB'22
  13. Optimizing Video Analytics with Declarative Model Relationships, VLDB'22
  14. FiGO: Fine-Grained Query Optimization in Video Analytics, SIGMOD'22
  15. 数据受限条件下的多模态处理技术综述, 中国图象图形学报'22
  16. 支持深度学习的视觉数据库管理系统研究进展, 软件学报'23

General Multi-Modal

  1. Multimodal Neural Databases, SIGIR'23
  2. Databases Unbound: Querying All of the World's Bytes with AI, VLDB'24
  3. A Declarative System For Optimizing AI Workloads, CoRR'24
  4. LOTUS: Enabling Semantic Queries with LLMs Over Tables of Unstructured and Structured Data, CoRR'24
  5. Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs, VLDB'24

Multi-Modal Query Optimization

Machine Learning Tasks Optimization

  1. AIDB: a Sparsely Materialized Database for Queries using Machine Learning, DEEM'24
  2. Hydro: Adaptive Query Processing of ML Queries, CoRR'24

LLM-Based Query Optimization

  1. Towards Multi-Modal DBMSs for Seamless Querying of Texts and Tables, CoRR'23
  2. Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes, CIDR'23
  3. Demonstrating CAESURA: Language Models as Multi-Modal Query Planners, SIGMOD'24
  4. CAESURA: Language Models as Multi-Modal Query Planners, CIDR'24
  5. No More Optimization Rules - LLM-enabled Policy-based Multi-modal Query Optimizer, CoRR'24

Interactive Database

  1. VIVA: An End-to-End System for Interactive Video Analytics, CIDR'22
  2. VOCAL: Video Organization and Interactive Compositional AnaLytics, CIDR'22
  3. SeeSaw: Interactive Ad-hoc Search Over Image Databases, SIGMOD'22
  4. Demonstration of ThalamusDB: Answering Complex SQL Queries with Natural Language Predicates on Multi-Modal Data, SIGMOD'23
  5. ThalamusDB: Approximate Query Processing on Multi-Modal Data, SIGMOD'24

Multi-Modal Machine Learning

  1. The Platonic Representation Hypothesis, CoRR'24

MMML Survey

  1. Multimodal Machine Learning: A Survey and Taxonomy, TPAM'18
  2. Foundations and Trends in Multimodal Machine Learning - Principles, Challenges, and Open Questions, CSUR'24

Multi-Modal Retrieval

  1. UniIR: Training and Benchmarking Universal Multimodal Information Retrievers, CoRR'23
  2. Cross-Modal Retrieval: A Systematic Review of Methods and Future Directions, CoRR'24
  3. MUST: An Effective and Scalable Framework for Multimodal Search of Target Modality, ICDE'24

Multi-Modal RAG

  1. Retrieving Multimodal Information for Augmented Generation: A Survey, ACL'23
  2. Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs, CVPR'24

Vision Question Answering

LLM-Based VQA

  1. An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA, AAAI'22
  2. Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?, ACL'23
  3. KAT: A Knowledge Augmented Transformer for Vision-and-Language, ACL'22
  4. RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training, MM'23

LLM-Based VQA with MMRAG

  1. Retrieval Augmented Visual Question Answering with Outside Knowledge, EMNLP'22
  2. MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and Text, EMNLP'22

About

Paper of Multi-model and multi-modal databases

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published