OpenData is a radically different approach to database creation and
management. Learn how OpenData can help ensure timely, accurate, cost-effective creation of databases. Explore the mechanisms necessary for unadministered, collaborative database management, including techniques to ensure accuracy and data integrity.
The Open Source movement has transformed software development, yet most data remains under tight control. Few tools exist for collaborative creation and maintenance of databases. Contrary to the assumption that gatekeepers ensure the integrity of data, most existing databases are woefully inaccurate, outdated, and expensive to maintain.
Bruce Epstein proposes a radically different way of thinking about database creation and data management. He believes that data categorization happens best when users categorize data while the database evolves. He posits that screen-scraping and intelligent agents will fail to outperform users' ability to lend meaning and context to data. Using appropriate administrative tools and mechanisms, it is possible to collaboratively create large, accurate databases in a timely and cost-effective manner. Potential uses include bug databases, knowledge bases, product matrixes, and many other types of databases. "OpenData" does for data what "OpenSource" does for source code. It allows users to own and help create the data that they need to access.
Epstein will demonstrate a working
OpenData database (a.k.a. "metabase") and discuss the lessons learned from
early deployment tests. The browser-based software is developed using open
source technologies (MySQL and Perl) and does not require any client-side
software beyond the browser itself.