YouTube home Comedy Week on YouTube
Upload
Alert icon

Patent on "Long Tail" for automated content authorship.

PhilipMParker PhilipMParker·29 videos
261
111,840

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to like PhilipMParker's video.

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to dislike PhilipMParker's video.

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to add PhilipMParker's video to your playlist.

Uploaded on Sep 16, 2007

Patent on "Long Tail" for automated content authorship.

FAQ

As the video shows, I am working on reference books, reports and educational titles (not fiction or literature).

The "algorithms" depend on the genre. The most advanced use parametric, non-parametric as well as Bayesian econometrics, graph theory, and meta analysis (mostly coupled with some specialized computational linguistics and editorial rules that are required within certain genres) -- each piece is rather straight forward; the combination allows complexity. In terms of IT or programming languages, there is no rigidity to this - again it depends on the genre. If animation is the goal, then code is written to write MEL scripts, etc., which can automate Maya, which can in turn automate rendering, lights, etc., via macros. This works well, but for only certain aspects of that genre.
For more detailed discussions, here is the patent link:

http://www.google.com/patents?id=bHeB...

Some titles are 98 to 100 percent computer automated (e.g. business titles, crosswords, etc.). For health titles, only the format editing and production side is automated. The text in the health books was written by medical professionals and edited by a professional editor; the computer expedited formatting using about 50 odd routines (the preface, chapter intros, glossaries, indexes, headings, margins, etc.); highlights are made to sources generally not known to internet-averse readers or medical practitioners (designed for medical libraries with internet training services).

Currently, some 2 percent of the titles rely on government sources for text. None perform a google search, spider the net, etc. Some 98 percent of the titles are wholly generated via automation programs; the applications create original information or content that cannot be found elsewhere (e.g. maximum likelihood trade estimates, latent demand forecasts via a decision calculus approach, Chinese and English crosswords, etc.) - offline applications with no interaction to the internet. In total, there are about 17 genres created this way (about 200,000 titles or so since 2000).

It can take several years to set up an application (including all human inputs, licensed sound effects, textures, models, mocap, data, or decision rules that go into any genre-specific application). Platforms (e.g. Maya) pre-exist. The incremental, or marginal creation time per title is mentioned in the video.

The genres are blind or peer reviewed and/or vetted by users (e.g. librarians or end-users) before they are put into print. The games are played by kids to see what they like. For 3D games, a pre-existing rendering engine is like a blank word document. The rendering engine is not created from scratch, but licensed (like MS Word).

I am mostly now working on education titles for Asian, African, and Native American languages that do not have educational materials (games, supplements, texts, videos, mobile phone books, etc.) written in or augmented by their languages. See my dictionary at:

http://www.websters-online-dictionary...

to see a very small percent of the linguistic material used. Watch for a major update and linguistic augmentation to the dictionary this summer when I will also be introducing EVE. She is an "economically viable entity". A step beyond a chat bot, using some of the algorithms mentioned above (with a bit of utility theory and optimal control theory thrown in).

There is no "commercial" or "public" or "open source" software that can be used by the general public. Some applications are terabytes large. I am working on a relatively small poetry application for public use -- to be released when completed (probably in a year), which will do several forms of poetry, on any topic the user desires; and allow the user to request "another" if they do not like the first one written, or "change that line", etc.

I am not actively working on fiction novels as a priority, though the process is in place for romance novels or similar formulaic types of literature. Fun to do, but not very useful.

There are many other areas I am working on, as there are multiple avenues to explore, especially in the areas of new media (mobile and fixed), but more so in high-end analytics and knowledge discovery (i.e. generating knowledge that could not be created otherwise) as applied to business, language and public services (e.g. criminology) - where unmanageable, sparse, disintegrated or larger data sets (off-line) result in new knowledge structures usable by decision makers (e.g. connecting the dots where humans have difficulty doing so, for lack of time or expertise).

Thanks for watching the video.
Phil

Loading icon Loading...

Loading icon Loading...

Loading icon Loading...

The interactive transcript could not be loaded.

Loading icon Loading...

Loading icon Loading...

Ratings have been disabled for this video.
Rating is available when the video has been rented.
This feature is not available right now. Please try again later.

Uploader Comments (PhilipMParker)

  • PhilipMParker

    As requested, sample grammatical acrostics, practiced in elementary schools to introduce children to poetry (title is an acronym for words in the poem):

    NUDE

    Naked unclad, dear enactment.

    LOVE

    Lean of vile emotions.

    GOD

    Gentlemen of divinity!

    BOOK

    Bible ordered, obtained Koran.

    Uses graph theory (clique commonality) and over 40,000 grammatical structures, ranked by meta-analytic probabilities of being understood by English readers (see "More info" link above to the right).

    · 2

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate PhilipMParker's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate PhilipMParker's comment.
  • sycomsimon

    Hi Phil, Is it available yet to use in compilation for individual users as i need to gather reports that today takes so much time to gather and read that there is just not enough time, it feels, to accomplish this. Wow your algorithim would be very useful in this.

    Please let me know if there is a beta version users can use? for a fee? or?

    Thanks,

    Simon

    p.s. Very cool mate :-)

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate sycomsimon's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate sycomsimon's comment.
  • PhilipMParker

    Hi,

    No beta available for public use. Good idea though.

    Phil

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate PhilipMParker's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate PhilipMParker's comment.
    in reply to sycomsimon (Show the comment)
  • hgld

    Very interesting and equally controversial. It would be interesting to discuss the copyright issues associated with this sort of publishing.

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate hgld's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate hgld's comment.
  • PhilipMParker

    Hello,

    The applications create original content that have copyrights, it does not produce material that violates existing copyright. If a photo or image or passage is cited, this is done with permissions, as per the publishing industry. Such usage is not innovative in this regard. The patent covers the generation of original material. The link in the FAQ provides more info. Phil

    · 3

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate PhilipMParker's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate PhilipMParker's comment.
    in reply to hgld (Show the comment)
  • TimothyCohn

    Nice work Phil.

    Is this the machine equivalent of Hypergraphia?

    Cybergraphia?

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate TimothyCohn's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate TimothyCohn's comment.
  • PhilipMParker

    Yes, I guess so :>

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate PhilipMParker's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate PhilipMParker's comment.
    in reply to TimothyCohn (Show the comment)

Top Comments

  • kitchnsyncrecords

    OH MY GOD SKYNET!

    · 15

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate kitchnsyncrecords's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate kitchnsyncrecords's comment.
  • Fei-Hong Wong

    wow! we're inventing ourselves out of existence.

    · 9

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Fei-Hong Wong's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Fei-Hong Wong's comment.

All Comments (28)

Sign in now to post a comment!
  • Sergiy Kulibaba

    George Orwell knew it!

    · 2

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Sergiy Kulibaba's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Sergiy Kulibaba's comment.
  • Snowzinger

    The future is nigh...

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Snowzinger's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Snowzinger's comment.
  • Paperplatesclothing

    how do i invest?

    · 2

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Paperplatesclothing's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Paperplatesclothing's comment.
  • iiTudy

    WTF? This just pwned in 10 minutes, 5000 years of book writing and authors.

    · 2

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate iiTudy's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate iiTudy's comment.
  • Yaroonn

    This is so friggin' scary.

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Yaroonn's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Yaroonn's comment.
  • feltfirefox

    Interesting stuff!

    Parallel text editions of classic works, with attendant automatically generated mp3 readings would be of great use to language learners (cf the 'listening/reading method') - they can take a long time to put together by hand. I suppose that relevant grammatical notes could also be added automatically. There would be a sizeable market for these.

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate feltfirefox's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate feltfirefox's comment.
  • arbitterm

    You could make million by selling exclusively to High School and College students.

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate arbitterm's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate arbitterm's comment.
  • Adam Nunn

    Could this program realistically create an essay or document on command? for example, If I wanted a 2 page report on the life of Napoleon, and had a database with hundreds of pages on napoleon, could it pick out important events, and compile them? or could it create a scientific lab writeup when given experimental results and a problem?

    If so, this is the greatest creation I have heard of, if not, I hope it will someday lead to this.

    ·

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Adam Nunn's comment.

    Sign in to YouTube

    Sign in with your YouTube Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to rate Adam Nunn's comment.
  • Loading comment...
Loading...
Loading...
Working...
Sign in to add this to Watch Later