Google has over 100 million lines of code, and our biggest programming language is C++. We have a single, shared codebase developed primarily on mainline. We build every binary and all of its libraries from scratch every time, allowing us to incrementally evolve APIs and libraries over time. The entire development process is extremely incremental in nature, and even API-breaking changes are a regular occurrence. However, for core libraries used throughout the codebase, this development model is a huge challenge: how do we incrementally evolve an API in use by tens of thousands of other libraries? The answer is to use Clang to automatically refactor APIs and their users across the codebase. How do we scale Clang up to possibly the single largest unified codebase in the world? The same way Google scales anything else: MapReduce. By coupling Clang's library design and architecture to existing Google infrastructure we can automatically compile, analyze, and refactor the entire Google codebase in minutes. In this talk, I will dive into the challenges of refactoring C++ code, how we're using Clang and making it even better at solving them, and how we scale these solutions to the size of our codebase.
Is there any place that we can watch for when this tool is released?
otherwiseguy 3 weeks ago
Really interesting :)
PixelAliensWorld 2 months ago
JuliaLLawall: we actually looked at Coccinelle before implementing the language, and found that it was not sufficient nor easy enough to extend for what we wanted to do in C++. Unfortunately C++ is a completely different beast from C - there are a lot more corner cases and little hidden things that are really important to match on.
manuelklimek 2 months ago
It looks similar to what Coccinelle does, although Coccinelle provides pattern matching on the concrete syntax and allows the transformation specification to be integrated directly into the pattern specification, using a patch-like notation. It doesn't seem possible to provide an example on the youtube comment page, but there are examples on the web site. Currently, however, Coccinelle only supports C code, with a very minimal treatment of C++.
JuliaLLawall 2 months ago
Really good talk!
cardologist 2 months ago