bup is short for "backup". bup uses the file format of the distributed version control system Git. It solves Git's problems with big files. Deduplication is used to make backups space efficent (about five times smaller than rsnapshot's backups). Data is deduplicated globally across files and backups. If a small part of a big file is changed only little additional space is needed.
The major part of this talk will describe Git's concepts, the structure of a repository, file format, and go into detail about the resulting implications on backups.
After a demonstration of bup I'll describe the implemented algorithms and data structures and their resulting perfomance gains over other backup solutions.
The talk will end with an overview of the recent development and a bait for new developers.