Google Developers Day US - Theorizing from Data
Sign in to YouTube
Sign in to YouTube
Sign in to YouTube
Uploaded on Jun 5, 2007
"Theorizing from Data: Avoiding the Capital Mistake
Peter Norvig
""It is a capital mistake to theorize before one has data."" Sir Arthur Conan Doyle's words from 1891 remain true today. Researchers in computational linguistics and information retrieval now have a million times more data than was available 30 years ago. This talk explores what this data can do for problems in language understanding, translation, information extraction, and inference, and extrapolates to what more data may bring in the future. "
-
Category
-
License
Standard YouTube License
Loading...
Loading...
Loading...
Loading...
Loading...
-
4:24
Google Headquarter - Amazing Work Place 9/19/07by Edward ChongFeatured
447,225
-
47:58
Google Developers Day US - Intro to Data APIsby GoogleDeveloperDay
34,933 views
-
48:44
Google Developer Day US - Fast, Easy, Beautiful: GWTby GoogleDeveloperDay
237,589 views
-
47:20
Google Developers Day US - Maps API Introductionby GoogleDeveloperDay
238,173 views
-
44:02
Google Developers Day US - Python Design Patternsby GoogleDeveloperDay
50,241 views
-
45:33
Peter Norvig of Google on The History and Future of Technological Changeby singularitysummit
1,457 views
-
23:35
The Future of Search, Peter Norvig (Google)by CITRIS-UC.ORG
3,918 views
-
4:54
Peter Norvig's Gettysburg Address Powerpointby MethodContent
6,259 views
-
51:14
Math Encounters -- ABC Easy as 123 -- Peter Norvig (Presentation)by MuseumOfMathematics
2,211 views
-
20:35
Hans Rosling: Stats that reshape your world-viewby TEDTalks
932,439 views
-
1:02:56
Peter Norvig - The Unreasonable Effectiveness of Databy UBC Computer Science
19,201 views
-
55:16
Google I/O 2009 - The Myth of the Genius Programmerby Google Developers
223,549 views
-
1:02:25
TODAY: Innovation in Search and Artificial Intelligenceby CITRIS-UC.ORG
10,397 views
-
21:27
David McCandless: The beauty of data visualizationby TEDtalksDirector
255,918 views
-
1:08:40
Lecture 1 | Machine Learning (Stanford)by Stanford University
442,800 views
-
25:59
Peter Norvig at Startup School 08by startupschool
3,424 views
-
45:54
Reinventing Education with Khan Academy and AI Classby Google+ Hangouts
28,084 views
-
16:51
Tim Berners-Lee: The next Web of open, linked databy TEDTalks
95,800 views
-
18:59
Scaling Data: Postgres, The Stack and the Future of Replicationby linuxconfau2012
1,586 views
-
6:12
Peter Norvig: The 100,000-student classroomby TEDtalksDirector
33,037 views
- Loading more suggestions...
Top Comments
pixiemotion 5 years ago
The basic method of a probabilistic translation model and a language model is relatively old news (Brown et al, 1990), and the same criticisms that applied 17 years ago have not been answered here: what do you do with language pairs that differ?
Now, if they manage to translate English-Klingon, that'd be impressive.
Sign in to YouTube
Sign in to YouTube
pixiemotion 5 years ago
Very interesting overview, but the question session in the end revealed a rather low competence among the audience, which is too bad -- there are some much more interesting theoretical questions to be asked. For one, this type of machine translation seems to be founded on having some sort of parallell aligned texts; this is relatively easy for German and English as showed in the examples, they're very similar languages both syntactically and lexically.
Sign in to YouTube
Sign in to YouTube
All Comments (16)
Shreeraj Jadhav 1 month ago
video quality sucks! Can't read slides.
Sign in to YouTube
Sign in to YouTube
spleenblender 4 years ago
Pattern recognition pattern recognizer.
Godel is laughing somewhere.
Sign in to YouTube
Sign in to YouTube
WetlandsRemediation 4 years ago
The most accurate translation is not always the best.
Idiom, by definition, has no translation.
Sign in to YouTube
Sign in to YouTube
54spiritedwill54 5 years ago
Quite interesting...
Sign in to YouTube
Sign in to YouTube
LethalCoke 5 years ago
He mentioned a DVD that Google sold which had their collection of English words. Anyone know how to obtain it?
Any help will be VERY appreciated^^
Sign in to YouTube
Sign in to YouTube
Erudecorp 5 years ago
You would put that into the search criteria, and have it search words within words (synthetic) or context (analytic). English is a mix of synthetic and analytic already, so you can see it already has those capabilities.
Sign in to YouTube
Sign in to YouTube
pixiemotion 5 years ago
Ut what happens when you try aligning eg. polysynthetic languages such as the Greenlandics (where a single word may express what in English would be a ten letter sentence) and analytic languages such a s Chinese (where the average word length is, what, 2.5 letters?). There are a lot of challenges to be met, and it'd be very interesting to see how Norvig and the Google MT team are dealing with them.
Sign in to YouTube
Sign in to YouTube
PGTaboada 5 years ago
IMHO one of the best sessions I have seen from GDD US.
Sign in to YouTube
Sign in to YouTube