by Jeffrey Pomerantz 45 views
Discussion of the continuum of ages of DLs: young -- maturing -- adult. From The Digital Library: A Biography, by Greenstein and Thorin. For INLS 740, SILS, UNC-CH.
by Chris Pirillo 9,870 views
H.264, Zip, Rar, MJPG are all compression formats. They all offer different modes and compression levels. Knowing when to use each is important.
by smedia1 19,051 views
Walking though the basics of video encoding from codecs and formats to settings to publishing, this tutorial is a primer to the world of encoding, using Sorenson Squeeze. For more encoding tips visit: http://forum.sorensonmedia.com/forum/content.php?200-Top-Enc
To get a free trial of Squeeze 7 visit: http://www.sorensonmedia.com/
by OCRTerminal 21,641 views
http://www.ocrterminal.com/ Optical character recognition works by analyzing your image and identifying characters as accurately as possible. The video gives a brief overview of some Imaging techniques used by popular OCR software. Brought to you by the online OCR service - OCR Terminal.
Transcript of this video:
Optical Character recognition or OCR as it is popularly known, is the process of extracting text from images of documents. These images could have been produced by scanners, digital cameras or even mobile phones and they might contain text based information which you might have to edit. Instead of having to retype all the text, you could use OCR to make things a little easier.
Let me give you an example - I just scanned a document and it looks like this.
You can save it as a .tiff, .png or .jpeg file.
Here are a couple of issues typically associated with scanned images. Before your document is OCR-ed, some preprocessing actions take place:
- Such skewed images typically result from a sheet-fed scanner as opposed to a flatbed scanner. These slighted rotated images will be corrected.
- You might have encountered those lines across your scanned documents that created small breaks, gaps and holes? These will be eliminated during the imaging process.
- There will also be an automatic detection of page orientation.
Other operations such imaging techniques such as "despeckling, noise removal and deinterlacing" are performed so that the OCR engine gets the cleanest possible image.
Next, area where written/printed text is found on your document is marked out and segmented into isolated characters.
The OCR engine then takes over and identifies them.
Modern OCR engines will not only deliver editable and searchable text but also preserve the page formatting such as the different font, colour, columns, figures, tabular data, headers and footers.
At OCR Terminal, we make things really easy -- just upload the image, click yes and your image is converted to your preferred format of .doc, .pdf, .rtf, .txt. So try it for yourself. Sign up for a trial account at www.ocrterminal.com.
by outofmyfolkingmind 11,477 views
The Kirtas APT 1200 in action. This one uses a single 16 megapixel camera and a mirror that flips back and forth to image both sides of the book. The film shows it digitizing a reproduction of an out of print book from the 1870s. The reproduction was created from digitial images generated by the machine. With the high rate of scanning speed, fragile, out of print books can be safely digitized and then reprinted in an "on demand" basis, removing the need for a substantial sized service bureau to complete the work. The quality of the images coming from the camera is superior, guaranteeing that any extremely fragile book need only to be scanned once.
by GoogleTechTalks 284,079 views
Google Tech Talk
May 3, 2012
Presented by Dany Qumsiyeh
See a hardware prototype of an automatic, non-destructive book scanner. The machine turns pages automatically, and captures high-resolution images of each page. It was developed in 20% time, and built in the Google Workshops.
There will be a live demo and a short technical presentation, describing the design process.
Speaker Info: Dany Qumsiyeh is an engineer on the Google Books team.
For more information on this project, see:
by Google 18,992 views
July 26, 2006
Luis von Ahn is an assistant professor in the Computer Science Department at Carnegie Mellon University, where he also received his Ph.D. in 2005. Previously, Luis obtained a B.S. in mathematics from Duke University in 2000. He is the recipient of a Microsoft Research Fellowship. ABSTRACT
Tasks like image recognition are trivial for humans, but continue to challenge even the most sophisticated computer programs. This talk introduces a paradigm for utilizing human processing power to solve problems that computers cannot yet solve. Traditional approaches to solving such problems focus on improving software. I advocate a novel approach: constructively channel human...
by TED 108,846 views
http://www.ted.com 20 years ago, Tim Berners-Lee invented the World Wide Web. For his next project, he's building a web for open, linked data that could do for numbers what the Web did for words, pictures, video: unlock our data and reframe the way we use it together.
TEDTalks is a daily video podcast of the best talks and performances from the TED Conference, where the world's leading thinkers and doers give the talk of their lives in 18 minutes. TED stands for Technology, Entertainment, Design, and TEDTalks cover these topics as well as science, business, development and the arts. Watch the Top 10 TEDTalks on TED.com, at http://www.ted.com/index.php/talks/top10
Follow us on Twitter
Checkout our Facebook page for TED exclusives
by TED 29,419 views
http://www.ted.com Brewster Kahle is building a truly huge digital library -- every book ever published, every movie ever released, all the strata of web history ... It's all free to the public -- unless someone else gets to it first.
by Jas A 14,215,648 views
Not made by me,found it online,available for download on website
Disney Parody explanation of Copyright Law and Fair Use
Professor Eric Faden of Bucknell University provides this humorous, yet informative, review of copyright principles delivered through the words of the very folks we can thank for nearly endless copyright terms.
***Description taken from website***
Video Found at http://cyberlaw.stanford.edu/documentary-film-program/film/a
by UNCChapelHill 9,537 views
Cory Doctorow talks about his views on copyright, copyright reform, media conglomerates, and the need for change.
by UNCChapelHill 3,187 views
Dr. Cal Lee, Assistant Professor in SILS, presents a lecture on digital curation, the risks of not planning for preservation, digital archaeology, data exercise, emulation versus transformation, the need for standards and a hybrid approach to preservation.
by LibraryOfCongress 936 views
The advent of personal computing devices such as PCs and smart phones with terabyte storage has enable the capture, recording and recall of everything a person reads, writes and hears. These are the new records and artifacts of the 21st century digital person.
Since 1998 Gordon Bell of Microsoft Research has worked on MyLifeBits, a system to digitally store everything in a person's life, including accumulated and current articles, books, correspondence, financial and legal records, memorabilia, photos, telephone calls, time-lapse photos, video and web pages. MyLifeBits is both an experiment in lifetime storage and a software research effort.
For archivists, exponentially increasing amounts of personal digital artifacts will soon arrive seeking immortality at the portals of museums and libraries, providing a new challenge to institutions accustomed to dealing with an analog person's boxes of papers and memorabilia of past millennia. Organizing, retrieving, preserving and protecting these fleeting, bit-based artifacts over the long-term is the contemporary archivist's greatest challenge.
Bell's talk touched on the project's history and discussed the research challenges and the wide-ranging social and personal benefits of the MyLifeBits technology, especially as it pertains to cultural memory institutions.
Speaker Biography: Gordon Bell spent 23 years (1960-1983) at Digital Equipment Corporation as Vice President of Research and Development, where he was responsible for Digital's products. He was the architect of various mini- and time-sharing computers (e.g. the PDP-6) and led the development of DEC's VAX and the VAX Computing Environment. Bell has been involved in, or responsible for, the design of many products at Digital, Encore, Ardent, and a score of other companies. He has been involved in the design of about 30 multiprocessors. He is a founding board member of The Computer History Museum at 1401 Shoreline, Mountain View, CA, established in 1999. The museum's world-class artifact collection came from the former Computer Museum, Boston that he co-founded, 1979 with Gwen Bell that originated in 1975 with the now deceased Digital Equipment Corporation that became part of HP in the lat2 1990s. He became a fellow of the Museum on 22 October 2003.
by rivervalleytv 1,104 views
Collections Librarian, Penrose Library, University of Denver
More videos from the STM Annual Spring Conference 2011 conference are available here:
by simmonscollege 3,259 views
Michael Leach, Head of Collection Development, Cabot Science Library, at Harvard University, and Adjunct Professor, Simmons Graduate School of Information and Library Science
People face a vast array of digital information on the Internet, much of it free. This access has led some to state that "the library is dead" — replaced by Google, Wikipedia, Pub Med and other sites. There is truth to this statement — and some false assumptions. Professor Leach will highlight the major changes already occurring in libraries, due to the explosion of information technology, and discuss how emerging technologies are likely to affect (and not affect!) libraries and information services in the near future.
These are videos from Simmons College in Boston. Simmons is a women-centered, small university located in the heart of Boston. We offer an education that prepares students for leadership positions and career success.
300 THE FENWAY BLOG:
by Mike Browne 6,987 views
Most DSLR cameras can shoot Jpegs, Tiffs and Raw files and these are what are called image file formats. So what are they? Well let me explain.
When you take a photo on a digital camera it shoots a Raw file which is not an image file format at all. It's a data file containing everything needed to create an image file from, kinda like a recipe.
If you're set to shoot jpeg then the camera's processor looks at the data and makes the jpeg file from it and throws the raw file away. The same things happens when you shoot in Tiff mode - it creates the tiff from the raw then deletes the raw.
So what are the benefits? Jpegs are very compressed and small so the are a convenient image file format to use for most things. The downside is that they degrade each time you save them so if you're working in an image editor and save as you work you're actually making the image fall apart.
Tiff files are much more stable than jpegs but they're much bigger file sizes.
If you shoot on Raw mode only then the camera doesn't process the image at all it just keeps it. You then have to download it to your computer and use raw file processing software like Lightroom or Photoshop to make the jpeg image yourself.
Because there's so much more data in a RAW (http://www.photographycourses.biz/image_file_formats.html) you can make a better quality image file because unlike the camera - you have a brain and can make the finished image look the way you want it to. Not the way the camera thinks it should be.
by Jeffrey Pomerantz 62 views
Introduction to data models, precursor to discussion of the data models of digital library applications. For INLS 740