Sponsor Sessions

For meeting with your colleagues, chatting, discussions, SIGMOD provides a virtual conference site in Gather.
10:00 AM – 10:30 AM
ORACLE Research Talk
Data Architect’s Dilemma: Multiple Specialized Databases vs a Converged Autonomous Database
Speaker: Tirthankar Lahiri
The classic dilemma faced by an enterprise data architect is whether to use multiple single-purpose data management solutions for different use cases, or a single data management solution for all use cases. We will  present the pros and cons of these approaches, and then show how Oracle’s machine learning enhanced Autonomous Database enables a converged data management solution, one that combines  the benefits of both approaches without the drawbacks.
Speaker Bio: Tirthankar Lahiri is Senior Vice President of the Data and In-Memory Technologies group for Oracle Database and is responsible for the Oracle Database Engine (including Database In-Memory, Data and Indexes, Space Management, Transactions, and the Database File System), the Oracle TimesTen In-Memory Database, and Oracle NoSQLDB.  Tirthankar has 24 years of experience in the Database industry and has worked extensively in a variety of areas including Manageability, Performance, Scalability, High Availability, Caching, Distributed Concurrency Control, In-Memory Data Management, NoSQL architectures, etc. He has 40 issued and several pending patents in these areas. Tirthankar has a B.Tech in Computer Science from the Indian Institute of Technology (Kharagpur) and an MS in Electrical Engineering from Stanford University. He was in the PhD program in Stanford University and while Oracle Database interrupted his PhD, he had a great time doing research on topics ranging from multi-processor OS kernels to semistructured data management. (Look up “Hive” and “Ozone” at Stanford, if you are interested!)
Zoom Link
12:00 PM – 12:30 PM
IBM Research Talk
Data Research at IBM Research AI
Speaker: Fatma Ozcan and Sameep Mehta, IBM Research AI
In this talk, we will provide an overview of IBM research, and describe some of the recent innovations in the area of data management, AI, data lifecycle and AI applications from IBM Research AI.
Zoom Link
1:00 PM – 1:30 PM
Intel Research Talk
Block-Based Computational Storage Using Virtual Objects

Speaker: Ian Adams

Abstract: Interest in computational storage is again on the rise, as are discussions around the I/O models for interacting with computational storage targets. Most prior and current approaches either require a departure from the widespread block protocol to one that is more computationally-friendly (e.g., file, object, or key-value), or they introduce significant host-side complexity on top of (or in addition to) the block protocol. We introduce a block-compatible design based on ephemeral virtual objects. Like a real object (e.g., a file), a virtual object contains the metadata that is needed to process the data, but does not require that a storage target maintain additional state like that required with a full object-based drive. We show how numerous offloads are possible using virtual objects and, as one example, demonstrate a 99% reduction in the data movement required to “scrub” object storage for bitrot. We also present our early work with erasure coded data which, unlike RAID, can be easily adapted to computational storage using virtual objects.

Bio : Ian F. Adams is a storage systems researcher in Intel’s Advanced Storage Research team. Before joining Intel, Ian worked as an engineer at Seagate technologies. His research interests are broadly in storage systems, computational storage, and long-term digital archiving.

Zoom Link
10:00 AM – 10:30 AM
 Amazon Recruiting Talk
Performance and Elasticity in Amazon Redshift
Speaker:  Ippokratis Pandis, Senior Principal Engineer
Amazon is hosting a virtual presentation with Ippokratis Pandis, Senior Principal Engineer on the DBS Redshift team on Wednesday, June 17th from 10:00am-10:30am PST. Join us to see how Amazon Redshift achieves its state-of-the-art performance and learn about all aspects of elasticity, from the compute and data elasticity within a single cluster to elasticity across multiple clusters. If you are interested in attending the virtual presentation, please use the Amazon Chime bridge below, or reach out to Program Coordinator, Darby Tinker,  dtink@amazon.com for registration details.
Amazon Chime Link
3:30 PM – 4:00 PM
Snowflake Q&A
Live Q&A – Data Driven Engineering at Snowflake
Snowflake is a new data platform built from the ground up for the cloud. It’s designed with a new architecture to be the centerpiece for data pipelines, data warehousing, data lakes, data application development, and for building data exchanges to easily and securely share governed data. The result? A platform delivered as a service that’s powerful but simple to use.

Engineers at Snowflake use our own platform to collect and analyze massive amounts of data from multiple facets of Snowflake services. This enables unprecedented insight into customer workloads and technical features and solutions that will benefit the workloads. It aids deciding new features will be most desirable and beneficial. And it aids in making the service itself more performance and easier to use. Join us to hear how we leverage data from our own platform to build a better platform.

Zoom Link
10:00 AM – 10:30 AM
Microsoft Demos

Building cloud-native HTAP solutions with Azure Synapse Link for Azure Cosmos DB:

 In this demo, we will learn more about Azure Synapse Link for Azure Cosmos DB, the first cloud-native HTAP capability that enables running analytics directly over operational data in near real-time with no ETL and no performance impact to mission-critical transnational workloads. We will explore a cool demo around how you can now build machine learning models, responsive business intelligence reports and big data analytics pipelines directly over Azure Cosmos DB leveraging the power of Apache Spark & SQL runtimes in Azure Synapse Analytics. You can find more details here.

 High Performance HTAP with Postgres and Hyperscale (Citus)

In this demo, we will run a large-scale HTAP workload on Azure Database for PostgreSQL with the Hyperscale (Citus) deployment option. Hyperscale (Citus) uses the open source Citus extension to turn a cluster of PostgreSQL servers into a single distributed database that can shard or replicate tables across the servers. Citus can simultaneously scale transaction throughput by routing transactions to the right server, and scale analytical queries and data transformations by parallelizing them across all of the servers.

Zoom Link
12:00 PM – 12:30 PM
Futurewei Research Talk
DFV-Chogori – an open source project on µsec latency distributed transactional memory-centric system
Speaker: Jerry Feng
Presentation Speakers:

  1. Hao Feng – Chief Architect, Cloud Storage Services
  2. Ivan Avarmov – Chief Architect, Chogori Project lead.
  3. Justin Funston – Senior Engineer, Cloud Storage Services
Zoom Link
1:00 PM – 1:30 PM
Facebook ResearchTalk
Data systems research at Facebook: an overview
Zoom Link
3:00 PM – 3:30 PM
Alibaba Group Product Talk
Alibaba Cloud AnalyticDB: Cloud Warehouse in the Era of Fast Data
Speaker: Dong Yu,Senior Staff Engineer at Alibaba Group
 In this talk, we will introduce Alibaba Cloud AnalyticDB, which is a cloud native data warehouse to empower our customers to fully explore and utilize the true value of their business data, especially in this coming era of fast data. We will talk about the challenges, architecture, practice, as well as customer scenarios of AnalyticDB
Zoom Link