How is the language of documents determined?