It is our great pleasure to welcome you to the Fourth International Workshop on Cloud Data Management (CloudDB 2012). This year we continue our tradition of serving as a premier forum for researchers and practitioners to present research results and share ideas and progress in the area of data management within cloud computing infrastructure. This broad area includes work in distributed storage, parallel algorithms, data mining, serving and analytic workloads, privacy, security, green computing, social workloads, and many others.
The call for papers attracted a wide range of submissions. The program committee accepted 7 papers on a variety of topics. In addition, the program includes 4 keynote speakers from leading cloud computing researchers: Dr. Carlo Curino, Dr. Mohamed Sharaf, Prof. Geoffrey Fox and Prof. Ashwin Machanavajjhala. We hope these proceedings will serve as a valuable resource to learn about the latest and most exciting work in cloud computing.
Proceeding Downloads
Facilitating real-time graph mining
Real-time data processing is increasingly gaining momentum as the preferred method for analytical applications. Many of these applications are built on top of large graphs with hundreds of millions of vertices and edges. A fundamental requirement for ...
Towards non-intrusive elastic query processing in the cloud
- Ticiana L. C. da Silva,
- Mário A. Nascimento,
- José Antônio F. Macêdo,
- Flávio R. C. Sousa,
- Javam C. Machado
Cloud computing is a very promising paradigm of service-oriented computing. One major benefit of cloud computing its elasticity, i.e., the system's capacity to provide and remove resources automatically at runtime. For that, it is essential to design ...
Benchmarking OLTP/web databases in the cloud: the OLTP-bench framework
Benchmarking is a key activity in building and tuning data management systems, but the lack of reference workloads and a common platform makes it a time consuming and painful task. The need for such a tool is heightened with the advent of cloud ...
Large scale data analytics on clouds
We summarize important overall issues affecting use of clouds to support Data Science. We describe the mapping of different applications to HPCC and Cloud systems and the architecture that support data analytics that is interoperable between these ...
Differentially private top-k query over MapReduce
Discovering that Map-Reduce framework is a popular way to deal with a large scale of data, but there is a significant risk to leak out users' personal information, especially when the data is sensitive, for example, including users' health records, ...
The Yahoo!: cloud datastore load balancer
- Markus Klems,
- Adam Silberstein,
- Jianjun Chen,
- Masood Mortazavi,
- Sahaya Andrews Albert,
- P.P.S. Narayan,
- Adwait Tumbde,
- Brian Cooper
Sherpa is a large-scale distributed and globally replicated multi-tenant cloud data storage system. Sherpa scales by horizontally partitioning data into tablets and distributing these tablets across multiple servers. While Sherpa scales for increasing ...
Challenges in enabling social application at scale: cloudDB'12 invited-keynote talk abstract
Internet users spend billions of minutes per month on social networking sites like Facebook, LinkedIn and Twitter. Not only do they create tons of data everyday in the form of posts, tweets and photos, the connections between users have given rise to ...
Cloud computing for environment-friendly data centers
The purpose of this paper is to analyze the carbon footprint and utilization rates in a data center. The long-term goal of this work is to give data center administrators an enhanced perspective of data center operations to allow for more energy ...
Finding the silver lining for data freshness on the cloud: [extended abstract]
Emerging NoSQL key-value data stores rely on data partitioning and replication to achieve higher levels of availability and scalability. Such design choices typically exhibit a tradeoff in which data freshness is sacrificed in favor of reduced access ...
HEDC: a histogram estimator for data in the cloud
With increasing popularity of cloud based data management, improving the performance of queries in the cloud is an urgent issue to solve. Summary of data distribution and statistical information has been commonly used in traditional database to support ...
A security aware stream data processing scheme on the cloud and its efficient execution methods
In order to process a few thousands of streams in real-time, public clouds seem to be an excellent choice since it supports massively amount of computing resources. A public cloud may be managed by a third party and outside the firewall of the ...
Recommendations
Acceptance Rates
Year | Submitted | Accepted | Rate |
---|---|---|---|
CloudDB '13 | 6 | 4 | 67% |
CloudDB '09 | 11 | 8 | 73% |
Overall | 17 | 12 | 71% |