Publications
2025
- Online Marketplace: A Benchmark for Data Management in Microservices. R. Laigner, Z. Zhang, Y. Liu, L. F. Gomes, Y. Zhou. In SIGMOD’25, International Conference on Management of Data, June 22-27, 2025, Berlin, Germany.
2024
- Rethinking State Management in Actor Systems for Cloud-Native Applications. Y. Liu, R. Laigner, Y. Zhou. In SoCC’24, ACM Symposium on Cloud Computing, Redmond, Nov. 20-22, 2024.
- Benchmarking Data Management Systems for Microservices. R. Laigner, Y. Zhou. In 40th IEEE International Conference on Data Engineering, ICDE 2024, Utrecht, The Netherlands, May 13-16, 2024.
- CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval. C. Lülf, D. M. L. Martins, M. A. V. Salles, Y. Zhou, F. Gieseke. In Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2024, Washington DC, USA, July 14-18, 2024.
- A Two-Layer Blockchain Sharding Protocol Leveraging Safety and Liveness for Enhanced Performance. Y. Xu, J. Zheng, B. Düdder, T. Slaats, Y. Zhou. In the 31st Annual Network and Distributed System Security Symposium, NDSS 2024, San Diego, California, USA, February 26 - March 1, 2024.
2023
- RapidEarth: A Search Engine for Large-Scale Geospatial Imagery. C. Lülf, D. M. L. Martins, M. A. V. Salles, Y. Zhou, F. Gieseke. In SIGSPATIAL, 2023. (Best Demo Award)
- Fast Search-By-Classification for Large-Scale Databases Using Index-Aware Decision Trees and Random Forests. C. Lülf, D. M. L. Martins, M. A. V. Salles, Y. Zhou, F. Gieseke. In Proceedings of the VLDB Endownment (PVLDB), 2023.
- An exploratory analysis of methods for real-time data deduplication in streaming processes. J. Esteves, R. M. Costa, Y. Zhou, A. Almeida. In DEBS 2023, Proceedings of the 17th ACM International Conference on Distributed and Event-based Systems, DEBS 2023, Neuchatel, Switzerland, June 27-30, 2023. (Best Industry & Applications Paper Award)
2022
- Event-Based Data-Centric Semantics for Consistent Data Management in Microservices. T. Zuckmantel, Y. Zhou, B. Düdder and T. Hildebrandt. In DEBS 2022, 16th ACM International Conference on Distributed and Event‐Based Systems, Copenhagen, Denmark, 2022.
- Hybrid Deterministic and Nondeterministic Execution of Transactions in Actor Systems. Y. Liu, L. Su, V. Shah, Y. Zhou, M. A. V. Salles. In SIGMOD 2022, pp. 65-78. (Presentation, Source Code)
2021
- Data Management in Microservices: State of the Practice, Challenges, and Research Directions. R. Laigner, Y. Zhou, M. A. V. Salles, Y. Liu, M. Klinowski. Proceedings of the VLDB Endowment (PVLDB), Vol 14, Issue 13, pp. 3348-3361, 2021.
- A Distributed Database System for Event-based Microservices. R. Laigner, Y. Zhou, M. A. V. Salles. In DEBS 2021, 15th ACM International Conference on Distributed and Event‐based Systems (DEBS). Milan, Italy, 2021.
- Fast Recovery of Correlated Failures in Distributed Stream Processing Engines. L. Su, Y. Zhou. In DEBS 2021, 15th ACM International Conference on Distributed and Event‐based Systems (DEBS), Milan, Italy, 2021.
- HawkEDA: A Tool for Quantifying Data Integrity Violations in Event-driven Microservices. P. Das, R. Laigner, Y. Zhou. In DEBS 2021, 15th ACM International Conference on Distributed and Event‐based Systems (DEBS), Milan, Italy, 2021.
- Enforcing Consistency in Microservice Architectures through Event-based Constraints. A. Lesniak, R. Laigner, Y. Zhou. In DEBS 2021, 15th ACM International Conference on Distributed and Event‐based Systems (DEBS), Milan, Italy, 2021.
2020
- ByteSeries : An In-Memory Time Series Database for Large-Scale Monitoring Systems. X. Shi, Z. Feng, K. Li, Y. Zhou, H. Jin, Y. Jiang, B. He, Z. Ling, X. Li. in Proceedings of the ACM Symposium on Cloud Computing (SoCC’20), Seattle, WA, USA, October 19-21, 2020.
- From a Monolithic Big Data System to a Microservices Event-Driven Architecture. R. Laigner, M. Kalinowski, P. Diniz, L. Barros, C. Cassino, M. Lemos, D. Arruda, S. Lifschitz, Y. Zhou. In 46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020, Portorož, Slovenia, Aug 26-28, 2020.
- Maxson: Reduce Duplicate Parsing Overhead on Raw Data. X. Shi, Y. Zhang, H. Huang, Z. Hu, H. Jin, H. Shen, Y. Zhou, B. He, R. Li, K. Zhou. In ICDE 2020, IEEE 36th International Conference on Data Engineering, Dallas, Texas, USA, April 20th -24th, 2020.
- Holding a Conference Online and Live due to COVID-19. A. Bonifati, G. Guerrini, C. Lutz, W. Martens, L. Mazilu, N. Paton, M.A.V. Salles, M.H. Scholl, Y. Zhou. ArXiv, abs/2004.07668.
2019
- Location-Centric View Selection in a Location-Based Feed-Following System. K. Chen, Y. Zhou. In DEBS 2019, 13th ACM International Conference On Distributed and Event-Based Systems, Darmstadt, Germany, 24th - 28th June, 2019.
- Deca: a Garbage Collection Optimizer for In-memory Data Processing. X. Shi, Z. Ke, Y. Zhou, L. Lu, X. Zhang, H. Jin, L. He, Z. Hu, F. Wang. In ACM Transactions on Computer Systems (TOCS), Volume 36 Issue 1, pp. 3:1–3:47, March 2019.
- Modeling and Building IoT Data Platforms with Actor-Oriented Databases. Y. Wang, J. C. D. Reis, K. M. Borggren, M. A. V. Salles, C. B. Medeiros and Y. Zhou. In EDBT 2019, 22nd International Conference on Extending Database Technology. March 26-29, 2019.
- Towards Low-Latency Batched Stream Processing by Pre-Scheduling. H. Jin, F. Chen, S. Wu, Y. Yao, Z. Liu, L. Gu, Y. Zhou. In IEEE Transactions on Parallel and Distributed Systems (TPDS), Vo. 30, Issue 3, pp. 710 - 722, March 2019.
- Passive and Partially Active Fault Tolerance for Massively Parallel Stream Processing Engines. L. Su, Y. Zhou. In IEEE Transactions on Knowlege and Data Engineering (TKDE), Vo. 31, No. 1, January 2019. (Special Section on best of ICDE’2016)
2018
- Graph Processing on GPUs: A Survey. X. Shi, Z. Zheng, Y. Zhou, H. Jin, L. He, B. Liu, Q. Hua, ACM Computing Surveys, Volume 50 Issue 6, 2018.
- Query-Centric Failure Recovery for Distributed Stream Processing Engines. L. Su, Y. Zhou. In ICDE 2018, 34th IEEE International Conference on Data Engineering, poster paper, Paris France, April 16th - 19th, 2018.
- Stateful Load Balancing for Parallel Stream Processing. Q. Guo, Y. Zhou. In: Euro-Par 2017: Parallel Processing Workshops. Euro-Par 2017. Lecture Notes in Computer Science, vol 10659. Springer, Cham
2017
- Parallel SPARQL Query Optimization B. Wu, Y. Zhou, H. Jin, A. Deshpande. In ICDE 2017, 33rd IEEE International Conference on Data Engineering, San Diego, California, USA, April 19-22, 2017.
- Integrative Dynamic Reconfiguration in a Parallel Stream Processing Engine. K. G. S. Madsen, Y. Zhou, J. Cao. In ICDE 2017, 33rd IEEE International Conference on Data Engineering, poster paper, San Diego, California, USA, April 19-22, 2017.
- Progressive Recovery of Correlated Failures in Distributed Stream Processing Engines. L. Su, Y. Zhou. In EDBT 2017, 20th International Conference on Extending Database Technology, short paper, Venice, Italy, March 21-24, 2017.
- CBP: A New Parallelization Paradigm for Massively Distributed Stream Processing. Q. Guo, Y. Zhou. In DASFAA 2017, 22nd International Conference on Database Systems for Advanced Applications, Suzhou, China, March 27-30, 2017.
- Lever: Towards Low-Latency Batched Stream Processing by Pre-Scheduling. F. Chen, S. Wu, H. Jin, Y. Yao, Z. Liu, L. Gu, Y. Zhou. In SOCC 2017, ACM Symposium on Cloud Computing, poster paper, Santa Clara, California, September 25-27, 2017.
2016
- Materialized View Selection in Feed Following Systems. K. Chen, Y. Zhou. In Proceedings of 2016 IEEE International Conference on Big Data (IEEE BigData 2016), Washington D.C., USA, Dec. 5-8, 2016.
- Lifetime-Based Memory Management for Distributed Data Processing Systems. L. Lu, X. Shi, Y. Zhou, X. Zhang, H. Jin, C. Pei, L. He, Y. Geng. Proceedings of the VLDB Endowment (PVLDB), Volume 9, Issue 12, 2016. (extended version)
- Enorm: Efficient Window-Based Computation in Large-Scale Distributed Stream Processing Systems. K. G. S. Madsen, Y. Zhou, L. Su. The 10th ACM International Conference on Distributed and Event-Based Systems (DEBS 2016), Irvine, CA, June 20 - June 24, 2016.
- Tolerating Correlated Failures in Massively Parallel Stream Processing Engines. L. Su, Y. Zhou. 32nd IEEE International Conference on Data Engineering (ICDE 2016), Helsinki, Finland, May 16-20, 2016. (selected as best of ICDE’16 to be invited to submit an extended version to TKDE.)
2015
- Scalable SPARQL Querying using Path Partitioning. B. Wu, Y. Zhou, P. Yuan, L. Liu, H. Jin. 31st IEEE International Conference on Data Engineering (ICDE 2015), Seoul, Korea, April 13-17, 2015.
- Adaptive Grid-Based k-median Clustering of Streaming Data with Accuracy Guarantee. J. Cao, Y. Zhou, M. Wu. The 20th International Conference on Database Systems for Advanced Applications (DASFAA 2015). Hanoi, Vietnam, 20-23 April, 2015. (Best Paper Award)
- Dynamic Resource Management in a MapReduce-Style Platform for Fast Data Processing. K. G. S. Madsen, Y. Zhou. Workshop on Cloud Data Management (CloudDM) In Conjunction with the IEEE International Conference on Data Engineering (ICDE 2015), Seoul, Korea, April 13-17, 2015.
- Online Data Partitioning in Distributed Database Systems. K. Chen, Y. Zhou, Y. Cao. 18th International Conference on Extending Database Technology (EDBT 2015), Brussels, Belgium, March 27-27, 2015.
- Dissemination of Anonymized Streaming Data. Y. Zhou, L. Shou, X. Shang, K. Chen. The 9th ACM International Conference on Distributed Event-Based Systems (DEBS 2015). Oslo, Norway, June 29 - July 3, 2015.
- Dynamic Resource Management in a Massively Parallel Stream Processing Engine. K. Madsen, Y. Zhou. 24th ACM International Conference on Information and Knowledge Management (CIKM 2015) , Melbourne, Australia, October 19-23, 2015.
- PROM: Efficient Matching Query Processing on High-dimensional Data. C. Ma, Y. Zhou, L. Shou, G. Chen. Information Sciences, Volume 322, 20 November 2015, Pages 1-19.
- Feedback Based Continuous Skyline Queries Over a Distributed Framework. A. K. L., J. Cao, Y. Zhou. Advances in Databases and Information Systems - 19th East European Conference (ADBIS 2015), Poitiers, France, September 8-11, 2015, Pages 287-301.
- Distributed Sequence Pattern Detection Over Multiple Data Streams. A. K. Leghari, J. Cao, Y. Zhou. Advances in Databases and Information Systems - 19th East European Conference (ADBIS 2015), Poitiers, France, September 8-11, 2015, pages 380-394.
2014
- Scheduling Online Repartitioning in OLTP Systems. K. Chen, Y. Zhou, Y. Cao. Middleware 2014 - ACM/IFIP/USENIX 15th International Middleware Conference, Bordeaux, France, December 8-12, 2014.
- SemStore: A Semantic-Preserving Distributed RDF Triple Store. B. Wu, Y. Zhou, P. Yuan, H. Jin, L. Liu. 23rd ACM International Conference on Information and Knowledge Management (CIKM), Shanghai, China, November 3-7, 2014.
- Efficient Pattern Detection over a Distributed Framework. A. K. Leghari, Y. Zhou, M. Wolf. 8th Business Intelligence for the Real Time Enterprise (BIRTE) in conjunction with VLDB 2014, 16 pages, Hangzhou, China, September 1, 2014.
- A Framework for Structural Refinement of XML Keyword Search. Q. Guo, Y. Zhou. 4th International Workshop on Semantic Search Over the Web in conjunction with VLDB 2014, Hangzhou, China, September 5, 2014.
- Integrating Fault-Tolerance and Elasticity in a Distributed Low-Latency Streaming System. K. G. S. Madsen, P. P. Thyssen, Y. Zhou. Proceedings of the 26th International Conference on Scientific and Statistical Database Management (SSDBM), demo paper, Aalborg, Denmark, June 30 - July 02, 2014.
- Sequence Pattern Matching over Time-Series Data with Temporal Uncertainty. Y. Zhou, C. Ma, Q. Guo, L. Shou, G. Chen. 17th International Conference on Extending Database Technology (EDBT), Athens, Greece, March 24-28, 2014.
- Efficient Skyline Computation in MapReduce. K. Mullesgaard, J. L. Pedersen, H. Lu and Y. Zhou. 17th International Conference on Extending Database Technology (EDBT), Athens, Greece, March 24-28, 2014.
2013
- Multi-Query Scheduling for Time-Critical Data Stream Applications. Y. Zhou, J. Wu, A. K. Leghari. 25th International Conference on Scientific and Statistical Database Management (SSDBM), Baltimore, Maryland, July 29-31, 2013.
- Multi-Scale Dissemination of Time Series Data. Q. Guo, Y. Zhou, L. Su. 25th International Conference on Scientific and Statistical Database Management (SSDBM), Baltimore, Maryland, July 29-31, 2013.
- Grand Challenge: MapReduce-style Processing of Fast Sensor Data K.G.S. Madsen, L. Su, Y. Zhou. Proceedings of the 7th ACM international conference on Distributed event-based systems (DEBS) , Arlington, Texas, USA, June 29 - July 3, 2013.
- Demo: ELASTIC Mapreduce-style Processing of Fast Data. K.G.S. Madsen, Y. Zhou. Proceedings of the 7th ACM international conference on Distributed event-based systems (DEBS) , Arlington, Texas, USA, June 29 - July 3, 2013.
- Efficient and scalable continuous skyline monitoring in two-tier streaming settings., H. Lu, Y. Zhou, J. Haustad. Information Systems, Elsevier Science Publishing Inc, Volume 38, Issue 1, Pages 68-81, 2013.
2012
- On Optimizing Relational Self-Joins. Y. Cao, Y. Zhou, C. Y. Chan, K.-L. Tan. 15th International Conference on Extending Database Technology (EDBT), Berlin, Germany, March 26-30, 2012.
- Energy Efficiency for MapReduce Workloads: An In-depth Study. B. Feng, J. Lu, Y. Zhou, N. Yang. The 23rd Australasian Database Conference (ADC), 2012. (Runner-up for the best paper award).
2011
- Matching Query Processing in High-Dimensional Space. C. Ma, Y. Zhou, L. Shou, D. Dai, G. Chen. 20th ACM Conference on Information and Knowledge Management (CIKM), short paper, 2011.
- Dissemination of Models over Time-Varying Data. Y. Zhou, Z. Vagena, J. Haustad. Proceedings of the VLDB Endowment (PVLDB), Volume 4, 2011.
2010
- Continuous Skyline Monitoring over Distributed Data Streams. H. Lu, Y. Zhou, J. Haustad. 22nd International Conference on Scientific and Statistical Database Management (SSDBM’10), Heidelberg, Germany, June 30 - July 2, 2010.
- Attribute Outlier Detection over Data Streams. H. Cao, Y. Zhou, L. Shou, G. Chen.15th International Conference on Database Systems for Advanced Applications (DASFAA’10), Tsukuba, Japan, April 1-4, 2010.
2009
- Cluster-Based Rank Query over Multidimensional Data Streams. D. He, Y. Zhou, L. Shou, G. Chen.The 18th ACM Conference on Information and Knowledge Management (CIKM 2009), short paper, Hong Kong, November 2-6, 2009
- Scalable Delivery of Stream Query Results. Y. Zhou, A. Salehi, K. Aberer. Proceedings of the VLDB Endowment (PVLDB), Volume 2, 2009.
- Data-Driven Memory Management for Stream Join. J. Wu, K.-L. Tan, Y. Zhou. Information Systems (Selected Papers from SSDBM’2007), Elsevier Science Publishing Inc, Volume 34 , Issue 4-5, Pages 454-467, June 2009.
- Towards Integrated and Efficient Scientific Sensor Data Processing: A Database Approach. J. Wu, Y. Zhou, K. Aberer, K.-L. Tan. 12th International Conference on Extending Database Technology (EDBT 2009), March 23-26 2009, Saint-Petersburg, Russia.
- Environmental Monitoring 2.0. S. Michel, A. Salehi, L. Luo, N. Dawes, K. Aberer, G. Barrenetxea, M. Bavay, A. Kansal, K. A. Kumar, S. Nath, M. Parlange, S. Tansley, C. van Ingen, F. Zhao, Y. Zhou, 25th International Conference on Data Engineering (ICDE 2009), Demo paper, March 29 - April 4, 2009, Shanghai, China.
- Query Allocation in Wireless Sensor Networks with Multiple Base Stations S. Xiang, Y. Zhou, H. B. Lim, K.-L. Tan. The 14th International Conference on Database Systems for Advanced Applications (DASFAA 2009), 21 - 23 April 2009, Brisbane, Australia.
- QoS-Oriented Multi-Query Scheduling over Data Streams J. Wu, K.-L. Tan, Y. Zhou. The 14th International Conference on Database Systems for Advanced Applications (DASFAA 2009), 21 - 23 April 2009, Brisbane, Australia.
2008
- Disseminating Streaming Data in a Dynamic Environment: an Adaptive and Cost-Based Approach. Y. Zhou, B. C. Ooi, K.L. Tan. The VLDB Journal, Springer, Vol. 17, No. 6, pp. 1465-1483, Nov 2008.
- Toward Massive Query Optimization in Large-Scale Distributed Stream Systems. Y. Zhou, K. Aberer, K.-L. Tan. ACM/IFIP/USENIX 9th International Middleware Conference (Middleware 2008), Leuven, Belgium, 2008.
- Rethinking the Design of Distributed Stream Processing Systems. Y. Zhou, K. Aberer, A. Salehi, K.-L. Tan. The Fourth International Workshop on Networking Meets Databases (NetDB), co-located with IEEE ICDE 2008 in Cancun, Mexico, 2008.
- Parallel Distributed Processing of Constrained Skyline Queries by Filtering. B. Cui, H. Lu, Q. Xu, L. Chen, Y. Zhou. 24th IEEE International Conference on Data Engineering (ICDE 2008), 2008.
2007
- Similarity-Aware Query Allocation in Sensor Networks with Multiple Base Stations. S. Xiang, H. B. Lim, K.-L. Tan, Y. Zhou. VLDB 2007 Workshop on Data Management for Sensor Networks (DMSN’07), Vienna, Austria, 2007.
- Window-Oblivious Join: A Data-Driven Memory Management Scheme for Stream Join. J. Wu, K.-L. Tan, Y. Zhou. 19th International Conference on Scientific and Statistical Database Management (SSDBM 2007), Banff, Canada, 2007.
- Two-Tier Multiple Query Optimization for Sensor Networks. S. Xiang, H. B. Lim, K.-L. Tan, Y. Zhou. The 27th International Conference on Distributed Computing Systems (ICDCS 2007), Toronto, Canada, 2007.
2006
- Efficient Dynamic Operator Placement in a Locally Distributed Continuous Query System. Y. Zhou, B. C. Ooi, K. L. Tan, J. Wu. 14th International Conference on Cooperative Information Systems (CoopIS 2006), Montpellier, France, 2006.
- Leveraging Distributed Publish/Subscribe Systems for Scalable Stream Query Processing. Y. Zhou, K. L. Tan, F. Yu. VLDB 2006 Workshop on Business Intelligence for the Real-Time Enterprise, Seol, Korea, 2006.
- Adaptive Reorganization of Coherency-Preserving Dissemination Tree for Streaming Data. Y. Zhou, B. C. Ooi, K. L. Tan, F. Yu. International Conference on Data Engineering 2006 (ICDE’2006), Atlanta, USA, 2006.
- Scalable and Adaptable Distributed Stream Processing. Y. Zhou. ICDE 2006 Ph.D. Workshop, Atlanta, USA, 2006.
- PMJoin: Optimizing Distributed Multiway Stream Joins by Stream Partitioning. Y. Zhou, Y. Yan, F. Yu, A. Zhou. The 11th International Conference on Database Systems for Advanced Applications (DASFAA 2006), 2006.
2005
- Dynamic Load Management for Distributed Continuous Query Systems. Y. Zhou, B. C. Ooi, K. L. Tan. International Conference on Data Engineering 2005 (ICDE’2005), Poster Paper, Japan, 2005.
- Optimizing Continuous Multijoin Queries over Distributed Streams. Y. Zhou, Y. Yan, B. C. Ooi, K. L. Tan, A. Zhou. ACM Fourteenth Conference on Information and Knowledge Management (CIKM’2005), Poster Paper, Germany, 2005.
- An Adaptable Distributed Query Processing Architecture. Y. Zhou, B. C. Ooi, K. L. Tan, W. H. Tok. Data & Knowledge Engineering. Elsevier Science Publishing Inc, North-Holland, Vol. 53, No. 3, pp. 283-309, June 2005.
2003
- Adaptive Distributed Query Processing. Y. Zhou. In Proc. of the VLDB 2003 PhD Workshop, Berlin, Germany, September 12-13, 2003. (Selected as one of the top 5 papers for presentation in the VLDB 2003 poster sessions.)