Google Tech Talk
November 5, 2009
ABSTRACT
Presented by Steven R. Bagley & David F. Brailsford, School of Computer Science, University of Nottingham, NOTTINGHAM NG8 1BB , UK
Computer document processing often starts with an abstract, structural, representation before entering a processing pipeline which creates a desired layout and appearance. But unfortunately the whole system resembles a series of steps in a one-way chemical reaction, or the successive irreversible stages of creating assembler code using a compiler.
This `one-way function' behaviour is most obvious with PDF, which is tied to a completely fixed appearance once a document passes through a one-way 'trapdoor' like Adobe Distiller. Some formats, such as XHTML, allow for a little more wriggle room but even this breaks down if the appearance changes dramatically (such as displaying a Web page on a large monitor). In essence, any attempt to reflow a document, or view it at some other size, is either frustrating, or simply impossible, without regenerating the document from a more abstract, higher-level representation.
This limitation has not had much effect over the past 25 years, but it is now hitting us hard. In a world of iPhones, eBook Readers, 10" netbooks, laptops, 30" Cinema Displays -- and not forgetting the humble printed page -- it is no longer safe to assume that a document will be viewed in one fixed presentation. `Repurposing' (without the need for
total re-processing) needs to be the watchword for a modern document format. However, this leads us to the heart of the problem: current formats don't lend themselves to having their presentational properties partially unpicked and re-engineered.
In this talk, we outline the current state of the art in document formats, and their limitations when it comes to repurposing. We describe our attempts at making PDF be a more repurposable format and we outline some necessary features, and open questions, for future document formats.
Interesting talk. I'm curious why they don't begin with LaTeX and try to make a stripped down version which might render quicker. Or reflow the PDFs before downloading it to the ebook reader. i.e. \documentclass[ebook]{book}.
fnord321 2 years ago
Nice, but I think I might've heard this somewhere before.
ajp06u 2 years ago
When you asked a question about "who will put the heavy weights", I thought it could be an external service. If you download a document and want to read it on a eBook reader, you not just download it, but get it passed through a "compilation" for your particular device.
detence 2 years ago
Not all endeavors need to be about profiteering when the inherent focus is on seamless capabilities within the realm of data/information sharing; a benevolent platform is laid for endless possibilities to capitalize on. The open source culture can serve as a relevant precedent.
Googletecktalks, Thanks for sharing!
Kindest Regards,
RealFreedom
RealFreedom4Real 2 years ago 2
You have to be an Apple customer to be so vacuous.
lennyhome 2 years ago