IBM Research at SIGMOD 2020

Share this post:

ACM SIGMOD/PODS 2020 like many other events impacted by COVID-19 pandemic will be taking place virtually from June 14 through June 19. The focus of work at SIGMOD 2020 ranges from adding graph querying to relational databases, to natural language interfaces to data, to operationalizing data for new AI workloads. Results to be presented includes work done at our IBM Research-Almaden and IBM Research-India labs, as well as by our summer interns from universities and our partners in other IBM units.

IBM is a proud Platinum Sponsor at this year’s conference. Register for free to view the full program at SIGMOD 2020. You can also access our virtual booth, where you can learn more about our work, as well as see and hear about our latest technology demos, and publications, including:

  • Data Quality Assessment and Improvements: AI Infused System to evaluate the quality of the data, discover “bad” areas and methods to improve the data quality
  • Supporting Synergistic and Retrofittable Graph Queries using IBM Db2 Graph
  • An Ontology-Based Conversation System for Knowledge Bases
  • Enabling Rich Queries over Heterogeneous Data from Diverse Sources in HealthCare

And if you would like to know more about IBM Research career opportunities, including the AI Residency Program, tell us more about yourself.

SIGMOD/PODS 2020 Tutorial & Panel

Tutorial: State of the Art and Open Challenges in NL Interfaces to Data
Fatma Ozcan (IBM Research – Almaden), Abdul Quamar (IBM Research – Almaden), Jaydeep Sen (IBM Research – India), Chuan Lei (IBM Research – Almaden), Vasilis Efthymiou (IBM Research – Almaden)

Tuesday, June 16
1:30 – 3:00 PM US Pacific

Panel: Startups Founded by Database Researchers
Moderator: C Mohan (IBM Research)

The panelists will discuss the trials and tribulations of their entrepreneurial efforts, what worked and what did not, and cover other topics on startups, spanning early-stage to successfully exited ones, both in the USA and Europe.

Wednesday, June 17
10:30 AM – 12:00 PM US Pacific

SIGMOD/PODS 2020 Papers

A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching
Venkata Vamsikrishna Meduri (Arizona State University), Lucian Popa (IBM Research, Almaden), Prithviraj Sen (IBM Research, Almaden), Mohamed Sarwat (Arizona State University)

Data matching session
Wednesday, June 17
10:30 AM – 12:00 PM US Pacific

IBM Db2 Graph: Supporting Synergistic and Retrofittable Graph Queries Inside IBM Db2
The demo of this work will be available at the IBM Sponsor Booth
Yuanyuan Tian (IBM Research), En Liang Xu (IBM Research), Wei Zhao (IBM Research), Mir Hamid Pirahesh (IBM Research), Sui Jun Tong (IBM Research), Wen Sun (IBM Research), Thomas Kolanko (IBM Cloud and Cognitive Software), Md. Shahidul Haque Apu (IBM Cloud and Cognitive Software), Huijuan Peng (IBM Cloud and Cognitive Software)

Graph DBs and KBs session
Wednesday, June 17
4:30 – 6:00 PM US Pacific

An Ontology-Based Conversation System for Knowledge Bases
The demo of this work will be available at the IBM Sponsor Booth
Abdul Quamar (IBM Research – Almaden), Chuan Lei (IBM Research), Dorian Miller (IBM Watson Health), Fatma Ozcan (IBM Research), Jeffrey Kreulen (IBM Watson Health), Robert Moore (IBM Research), Vasilis Efthymiou (IBM Research)

Graph DBs and KBs” session
Wednesday, June 17
4:30 – 6:00 PM US Pacific

SIGMOD/PODS 2020 Workshops

IBM researchers are also co-organizing workshops on Sunday, June 14, and Friday, June 19.

DSMM: International Workshop on Data Science for Macro-modeling with Financial and Economic Datasets (half day)  
Doug Burdick (IBM Research), Jay Pujara (USC ISI)

DSMM 2020 will explore the challenges of macro-modeling with financial and socio-economic datasets. The workshop will also showcase the Financial Entity Identification and Information Integration (FEIII) Challenge and will involve a challenge task over small business data. The DSMM workshop will explore technical challenges relevant to Business Open Knowledge Network (BOKN), which includes combining state-of-the-art computational approaches for extracting, representing, linking, and analyzing data with complex and nuanced knowledge about the business domain.

Sunday, June 14
8:00 AM – 12:00 PM US Pacific

AIDM: International Workshop on Exploiting AI Techniques for Data Management (full day)
Rajesh Bordawekar (IBM Research), Oded Shmueli (Computer Science Department, Technion), Tin Kam Ho (IBM Watson AI), Nesime Tatbul (Intel Labs and MIT)

AIDM is a one-day workshop that will bring together people from academia and industry to discuss various ways of integrating AI techniques with data management systems. The primary goal of the workshop is to explore opportunities for using AI techniques in enhancing various components of data management systems, such as user interfaces, tooling, performance optimization, support for new query types and workloads. Special emphasis will be given to transparent exploitation of AI techniques using existing data management infrastructures for enterprise-class workloads. We hope this workshop will identify important areas of research and spur new efforts in this emerging field.

Friday, June 19
9:00 AM – 5:00 PM US Pacific

Inventing What’s Next.

Stay up to date with the latest announcements, research, and events from IBM Research through our newsletter.


Distinguished Research Staff Member, Senior Manager-Hybrid Data, IBM Research

Sameep Mehta

Senior Technical Staff Member & Senior Manager, Knowledge Management & Engineering, IBM Research

More AI stories

MIT and IBM announce ThreeDWorld Transport Challenge for physically realistic Embodied AI

MIT Brain and Cognitive Sciences, in collaboration with the MIT-IBM Watson AI Lab, has developed a new Embodied AI benchmark, the ThreeDWorld Transport Challenge, which aims to measure an Embodied AI agent’s ability to change the states of multiple objects to accomplish a complex task, performed within a photo- and physically realistic virtual environment.

Continue reading

Mimicking the brain: Deep learning meets vector-symbolic AI

To better simulate how the human brain makes decisions, we’ve combined the strengths of symbolic AI and neural networks. Specifically, we combined the learning representations that neural networks create with the symbol-like entities represented by high-dimensional and distributed vectors. The idea is to guide a neural network to represent unrelated objects with dissimilar high-dimensional vectors.

Continue reading

Austin or Boston? Making artificial speech more expressive, natural, and controllable

We've developed speech synthesis technology that emulates the type of expressiveness humans naturally deploy in face-to-face communication. In our recent paper Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis presented at the IEEE Spoken Language Technologies Workshop in Shenzhen, China, we describe a system that can emphasize or highlight certain words to improve the expressiveness of a sentence or help with context ambiguity.

Continue reading