Cloud Computing and Digital Libraries First perspectives on a future technological alliance
Pedro Lopes | pedrolopes@ua.pt | PDEI - Seminars | Jan 23, 2009
Outline ‣ Internet ‣ Cloud-Computing ‣ Digital Libraries Issues • Services • Digital Data Storage
‣ Cloud Libraries
Internet
[http://xkcd.com/530]
Cloud Computing [Motivation] ‣ Internet has changed, it is “the” platform for everything • Accessible • Anywhere • Anytime • Everyone
How do novel applications work • Applications run in the browser within the cloud? • Entertainment: Hi5, Facebook, IMDb...
• Productivity: Office Live, Google Apps...
• “Everything”-as-a-service • Internet is the Cloud
[http://infreemation.net/wp-content/uploads/cloud-computing-kitchen-sink.jpg]
Cloud Computing [Architecture] ‣ GRID > Virtualization > Cloud
[http://www.slideshare.net/ruv/meshu-cloud-camp]
Cloud Computing [Benefits] ‣ Resource flexibility • Improve service level and availability
‣ Pay for what you use • Cost saving, leveraging economies of scale
‣ Rapid prototyping and market testing • Increased speed to market
‣ Self-service deployment ‣ Reduce lock-in and switching costs
Digital Libraries ‣ Store information in the digital world • Digital knowledge
‣ Make it available online
Get the data IN
• With some restrictions
‣ Issues • Lack of services / functionalities
Data Storage & Infrastructures
• Difficulties digital data storage
Get the data OUT
Services [Problem] ‣ There’s an enormous amount of data • But there’s a lack in advanced functionalities
‣ What can we do with the data? • Search • View
‣ What new services? • Information mining • Advanced queries • Semantics
Services [Solution] ‣ Services can be easily created in the correct frameworks ‣ Cloud-Computing • Built-in framework • Easy development of new services • Increases momentum in the developer community
• Faster testing and deployment • More processing power
Digital Data Storage [Problem] ‣ Digital data • Preservation • Component failures • Obsolescence • Attacks • Natural disasters
• Distribution • Availability • Heterogeneity • Scalability • Authenticity
Digital Data Storage [Solution] ‣ Preservation issues may be solved with distributed environments ‣ Cloud-computing • Built-in architecture • Distributed • Every single data bit is replicated • Metadata ‣ Geographic distribution ‣ Scalability
• Better availability is assured
Cloud Libraries [Conclusions] ‣ Information is the most important component ‣ “We shouldn’t care about anything else” • Leave that to the cloud-computing environment • Service development • Distributed architecture • Metadata
‣ Digital libraries and cloud-computing are a perfect match • Create a digital library based on a cloud computing operating environment • Microsoft Azure Services • Amazon Elastic Compute Cloud • Google App Engine
Questions?
Thank You