A multilingual corpus database for typological and genetic linguistics