Google's settlement with the publishing industry

Google's Settlement with the Publishing Industry: Opportunities and Strategies for Publishers Bill Rosenblatt GiantSteps Media Technology Strategies

Google Book Search ! Introduced (as Google Print) in October 2004 ! Scanning and indexing millions of books for search and discovery online – About 20 libraries, mostly university, scanning from their collections

! Google's use of content – Display snippets in search results – Link to places to purchase – Public domain books: download PDF

The Lawsuit ! Suits filed by Authors Guild and AAP in Sept-Oct 2005 ! Allegations of copyright infringement – Scanning and copying without authorization – "Free riding" on content by monetizing traffic

! Claims of Fair Use – Key principles: nature of use, effect of use on market value for work

Slide 4

The Settlement ! Agreement reached in October 2008 – 141 page main document – Plus 13 Attachments

! Final approval set for June 2009 or later

Slide 5

The Settlement ! Google pays ~$30 Million to establish Book Rights Registry ! Google and publishers participate in initial set of business models ! List of potential future business models

Slide 6

Book Rights Registry ! Online database of books and information about their ownership – Like a rights collecting society for book content

! Ability to process royalty payments from online content sales and send them to proper party ! Independent – Not just for Google – Any service provider can use the service

! Google pays >$30 Million to establish BRR

Slide 7

Settlement Architecture Content Publishers Publishers


Metadata, Rights, Payments

Metadata & Rights g Payments

Book Rights Registry

Content Libraries Libraries


Slide 8

Settlement Architecture Content

Publishers Publishers


Metadata M t d t & Rights Payments


Metadata, Rights, Payments

Book Rights g Registry

Payments Metadata & Rights

Service Providers Š 2009

Libraries Libraries


Slide 9

Settlement Business Models ! Sales of online books – Like Amazon Pages, online viewer – Restrictions on copy & paste, print – Free previews depending on type of book – Google gets 30% revenue

! Contextual ad sales – Publishers get 30% of revenue

Slide 10

Future Business Models ! Print on demand ! PDF downloads ! Custom Publishing ! Consumer subscriptions ! Summaries, abstracts, and compilations

Slide 11

Future Business Models P Page Image I ! Print on demand ! PDF downloads ! Custom Publishing ! Consumer subscriptions p ! Summaries, abstracts, and compilations

Slide 12

Future Business Models N -Page NonNon P Image I ! Print on demand ! PDF downloads ! Custom Publishing ! Consumer subscriptions p ! Summaries, abstracts, and compilations

Slide 13

Custom Publishing ! Combining chapters/modules of content into single volumes ! Markets: – Higher education e.g. McGraw-Hill Primis – Professional e.g. O'Reilly/Pearson Safari

! Next challenge: combine content from multiple publishers

Slide 14

Subscription Services ! Analogous to music services like Rhapsody and Napster, cable TV SVOD ! Existing subscription services charge per document, search, etc. – Factiva, LexisNexis, Dialog ! Rightsholder compensation gets tricky for book content

Slide 15

Abstracting and Indexing (A&I) ! Huge legacy of A&I services in many fields ! Customized A&I services can save effort with uniform online access to content and rights info

Slide 16

Publishers Must Be Ready Common Themes ! Logical Structure ! Metadata ! Rights ! XML Content Architecture

Š 2009

Slide 17

Logical Structure ! Repurposability of content at desired level of detail ! Sequencing information ! Conversion of legacy layout-driven content to XML ! Conversion of editorial processes to XML-first

Š 2009

Slide 18

Metadata ! Adoption of basic metadata in BRR – Dublin Core – bibliographic – ONIX – supply chain

! Other service providers – specialized metadata

Slide 19

Rights ! Initial business models: implied rights – E.g. copy/paste up to 4 pages – Print up to 20 pages at a time – Free previews according to content type

! Future business models: track rights explicitly – Opt in vs. opt out – E.g. right to use content in compilations – Simple now, extensibility later

© 2009

Unlock Content™

John Kreisa Director of Industry Solutions

Preparing for The Google Settlement Create a digital strategy for metadata, rights and content XML can be used to store, search and deliver each Establish a long term content architecture Create an XML based centralized digital g content repository p y Adopt and leverage XML in production and delivery processes Prepare to go below the page for new business models Organize for experimentation with new products Small flexible teams Tolerance for (fast) failure Prepare to interface with Google and other service providers Decide on strategy for exposing content to Google Consider options p with other p providers

Digital Asset Distribution in Practice

Admin interface (search, load, etc.)

IP Mgmt Search, Get Catalog, Get TOC, Look Inside Book, etc.

Loader Web services DAM

Other partners

Digital Asset Delivery/Syndication

Search Across Transcribed Video and Metadata Search through transcripts, analyze and understand word usage

Elsevier: ImagingCONSULT A role and task aware application li ti using i XQ XQuery Utilize structure to provide granular access to i f information ti Provides an integrated content environment Results – physicians work more quickly and with greater assurance App pp facilitates ac tates a and d gu guides des diagnostic d ag ost c process p ocess Greater satisfaction with diagnosis due to improved ability to compare procedures Spend less time looking and more time understanding and healing Copyright Š 2009 Mark Logic Corporation Confidential and Proprietary, Internal Use Only

Role Aware Application: ImagingCONSULT Rich navigation and task oriented display leverage tagged information

Detailed diagnostics

Guided navigation Summary info Copyright Š 2009 Mark Logic Corporation Confidential and Proprietary, Internal Use Only

Slide 6

McGraw-Hill Education – DAL Key component of a broader enterprise content management program; tightly integrated with ECM solution Strategic, open, and scalable environment for managing, managing sharing, sharing and processing digital assets and content archivists editors Designed to help archivists, editors, and authors find previously published content in full electronic workflow Reduces content acquisition and creation costs. Helps bring new products to market faster.

Digital Assets – Search & Reuse


ECM system

New Products

McGraw-Hill Education Digital Asset Library Designed to help archivists, editors, and authors find previously published content, the MGH Digital Asset Library is a powerful search and content discovery tool. This lib library will ill reduce d content t t acquisition i iti and d creation ti costs t and d help h l bring b i new products d t to market faster. Copyright Š 2009 Mark Logic Corporation Confidential and Proprietary, Internal Use Only

Actionable Insight and Collaboration Business Exchange g Monitor favorite topics and contributors, see related items

MarkLogic Server: Cornerstone of an All XML Architecture Print on Demand

Editorial Staff Content delivery

Content creation CMS

C Custom Publishing Customers DAD


Syndication Partners Vertical Content Delivery


Content Assembly

Web delivery Content applications

Content assembly and enrichment

Architecture for Exploiting the Google Book Settlement Content

Publisher (You)


Metadata & Rights Metadata & Rights & Payments Payments

Libraries Libraries Books Rights Registry

Selected Mark Logic Customers B2B Magazine

B2C Magazine


Fi Financial/CCF i l/CCF

Legal Tax Regulatory



Unlock Content™

Thank You

