Visual Perception with Deep Learning
Sign in to YouTube
Sign in to YouTube
Sign in to YouTube
Uploaded on Apr 10, 2008
Google Tech Talks
April, 9 2008
ABSTRACT
A long-term goal of Machine Learning research is to solve highy
complex "intelligent" tasks, such as visual perception auditory
perception, and language understanding. To reach that goal, the ML
community must solve two problems: the Deep Learning Problem, and the
Partition Function Problem.
There is considerable theoretical and empirical evidence that complex
tasks, such as invariant object recognition in vision, require "deep"
architectures, composed of multiple layers of trainable non-linear
modules. The Deep Learning Problem is related to the difficulty of
training such deep architectures.
Several methods have recently been proposed to train (or pre-train)
deep architectures in an unsupervised fashion. Each layer of the deep
architecture is composed of an encoder which computes a feature vector
from the input, and a decoder which reconstructs the input from the
features. A large number of such layers can be stacked and trained
sequentially, thereby learning a deep hierarchy of features with
increasing levels of abstraction. The training of each layer can be
seen as shaping an energy landscape with low valleys around the
training samples and high plateaus everywhere else. Forming these
high plateaus constitute the so-called Partition Function problem.
A particular class of methods for deep energy-based unsupervised
learning will be described that solves the Partition Function problem
by imposing sparsity constraints on the features. The method can learn
multiple levels of sparse and overcomplete representations of
data. When applied to natural image patches, the method produces
hierarchies of filters similar to those found in the mammalian visual
cortex.
An application to category-level object recognition with invariance to
pose and illumination will be described (with a live demo). Another
application to vision-based navigation for off-road mobile robots will
be described (with videos). The system autonomously learns to
discriminate obstacles from traversable areas at long range.
This is joint work with Y-Lan Boureau, Sumit Chopra, Raia Hadsell,
Fu-Jie Huang, Koray Kavakcuoglu, and Marc'Aurelio Ranzato.
Speaker: Yann Le Cun
Computational and Biological Learning Lab,
Courant Institute of Mathematical Sciences,
New York University.
-
Category
-
License
Standard YouTube License
Loading...
Loading...
Loading...
Loading...
Loading...
-
1:23:24
Brain Mind and Behavior: Defining the Mindby UCtelevisionFeatured
39,939
-
1:29:54
Think faster focus better and remember moreRewiring our brain to stay younger...by GoogleTechTalks
400,956 views
-
1:08:14
The Wires that Control the Public Mindby GoogleTechTalks
33,198 views
-
59:23
The Next Generation of Neural Networksby GoogleTechTalks
275,881 views
-
59:43
Managing Stress and Psychosomatic Disorders Through Meditationby GoogleTechTalks
47,045 views
-
56:46
The Quest for Consciousness: A Neurobiological Approachby UCtelevision
23,390 views
-
58:08
No Time to Thinkby GoogleTechTalks
131,286 views
-
52:03
Richard O. Brown: The Neuroscience of Nothingby UCtelevision
31,546 views
-
9
videos
Play all
boltzmann machinesby mac01021
-
58:26
What We See with Fred Dretskeby UCtelevision
6,062 views
-
48:10
Bay Area Vision Meeting: Unsupervised Feature Learning and Deep Learningby GoogleTechTalks
53,375 views
-
54:16
Neurodevelopmental Disorders: Perception of Emotionby UCtelevision
5,543 views
-
58:38
Learning 3D Models from a Single Still Imageby GoogleTechTalks
26,904 views
-
51:58
Learning from StackOverflow.comby GoogleTechTalks
101,722 views
-
1:01:12
Stress Relief for the Creative and Constantly Connectedby GoogleTechTalks
23,240 views
-
1:05:10
How Ant Colonies Get Things Doneby GoogleTechTalks
68,910 views
-
54:35
Personal Growth Series: Karl Deisseroth on Cracking the Neural Code: Speaking...by GoogleTechTalks
24,210 views
-
2:16
The Deep Learning Sagaby Yoshua Bengio
3,765 views
-
1:08:40
Lecture 1 | Machine Learning (Stanford)by StanfordUniversity
441,082 views
-
59:19
Making Great Decisionsby GoogleTechTalks
28,873 views
-
6:44
Visual Perception, Part 1by Robert Shapiro
792 views
- Loading more suggestions...
All Comments (18)
UniverseApproved 10 months ago
does anyone remember what it was like playing "I spy with my little eye" when they were little?
Sign in to YouTube
Sign in to YouTube
UniverseApproved 11 months ago
Seems an AI-complete computer vision system capable of such high resolution and presicion in it's relative-informational simulations would be highly relevant to controlling finance and governance. This enhanced ability to monitor and manipulate the matter on our planet seems part of a profound process to me.
I was just being silly in my original post, I mean you must have read Baudrillard's On Simulacra and Simulation, eh?
Sign in to YouTube
Sign in to YouTube
UniverseApproved 11 months ago
Consider AI-complete computer vision used as better, more efficient Vingean "localizers". Imagine a system like us (AI complete) but that has perfect memory, can "see" (and to a high degree of accuracy predict) the "numbers" between distinct objects within the visual input from it's physical environment, can see in other than visible light, and that operates millions of times faster than a human being.
Sign in to YouTube
Sign in to YouTube
UniverseApproved 11 months ago
wissner-gross talks about this in his "on planetary-scale intelligence" talk...if you believe that AI is likely to be produced through an economic process, then AI is highly likely to arise in the quantitative finance or quantitative advertising markets. Consider the high speed algorithms involved; better modeling of human behavior allows more money to be made, you know, sifting through entropy uncovering and exploiting hidden information.
Sign in to YouTube
Sign in to YouTube
3jrhall 11 months ago
If you know of something better than Restricted Boltzmann Machines for machine learning, I'd love to hear about it. And what does machine learning have to do with controlling finance and government? That's controlled by human greed, not weighted logic.
Sign in to YouTube
Sign in to YouTube
UniverseApproved 1 year ago
why are all you smart people allowing outdated governance and finance models to rule our realities? Don't you wish for shared freedom? I mean, this is some sad ass modeling.
Sign in to YouTube
Sign in to YouTube
CassandraAbbey 1 year ago
Blasphemy? how come?
Sign in to YouTube
Sign in to YouTube
adelle0001 1 year ago
Good job! ang ganda ng video!
Sign in to YouTube
Sign in to YouTube
Steve Grimm 2 years ago
Does anyone else think the intro guy looks like a young Jeff Goldblum?? ;)
Sign in to YouTube
Sign in to YouTube