Fixes #104
Add build rules, scripts, basic corpus, and dictionary. Currently requires recent clang toolchain.