This wiki service has now been shut down and archived

Database Paradigms

From ESIWiki

Jump to: navigation, search

Return to Workshop wiki Main Page

Data-Intensive Research: Data Analysis: Soliciting comments

Please add any comment you would like to make about this theme after the organiser's, Stratis Viglas's, introduction below. It can be specifically related to a talk or breakout session or be general. Please separate entries with headings (two = signs) that flag new topics, or subheadings (three = signs) and add your signature.

Database Paradigms

Over the years, database management systems (DBMSs) have been considered as the standard for data-intensive applications. Relational DBMSs have been able to scale to terabytes of data either for online or offline operations. However, their adoption from the scientific community has been far from universal. A lot of data-intensive problems do not use relational database technology for their processing needs -- sometimes not even as simple backing storage. Rather, they either rely on the file system, or build custom systems from scratch.

The purpose of this stream in the workshop is to identify the different modes of operation traditionally provided by a DBMS and the modes of operation non-DBMS solutions provide. In the context of the stream we will deal with the following topics in more detail:

  • Parallel databases as a way to massively parallelise the data flow of DBMSs.
  • Data warehouses and the need for operations that are different from the standard constructs SQL provides.
  • Vertical partitioning as a storage and processing alternative to alleviate the problems of executing query evaluation code on contemporary processors.
  • Scientific databases as an insight into what types of operation standard database technology might not be able to capture.
  • If SQL is not the correct answer for some of our data needs, is NoSQL the way forward?
  • MapReduce as a different programming paradigm for accessing and manipulating large amounts of data.
  • The need for horizontal and vertical scaling of a database.

Obtaining a grasp on these concepts will further our insight into what constitutes data-intensive research and what sort of functionality we would like to have in the future.

--Sviglas 16:27, 8 March 2010 (UTC)

Add New topics of discussion about this theme like this


Sub-topics like this

and sign your additions like this with the signing button above --MalcolmAtkinson 20:54, 8 March 2010 (UTC)

This is an archived website, preserved and hosted by the School of Physics and Astronomy at the University of Edinburgh. The School of Physics and Astronomy takes no responsibility for the content, accuracy or freshness of this website. Please email webmaster [at] ph [dot] ed [dot] ac [dot] uk for enquiries about this archive.