Vocabulary Transfer and Knowledge Distillation for Language Model Compression