The data and models will be useful for captioning local-language media; voice assistants for agriculture and health; ...