Inference pipeline to deploy IndicTrans NMT models on top of Flask server.
- Python 3.6 +
- Ubuntu 16.04 +
Install various python libraries as mentioned in requirements.txt file
pip install -r src/requirements.txt- Put all the models in
src/nmt_modelsfolder as per the specs inconfig/fetch_models.json - Run
app.pyto start the service with all the packages installed
python src/app.pyIndicTrans-v1 suports 11 major Indic languages:
| Assamese (as) | Hindi (hi) | Marathi (mr) | Tamil (ta) |
| Bangla (bn) | Kannada (kn) | Oriya (or) | Telugu (te) |
| Gujarati (gu) | Malayalam (ml) | Panjabi (pa) |
IndicTrans-v2 supports all 22 scheduled langauges of India, which includes English, 20 Indic languages (4 Dravidian, 15 Indo-Aryan, 1 Munda) and 2 Tibeto-Burman languages (Bodo & Manipuri).
| ISO 639 code | Language |
|---|---|
| as | Assamese - অসমীয়া |
| bn | Bangla - বাংলা |
| brx | Boro - बड़ो |
| doi | Dogri - डोगरी |
| gom | Goan-Konkani - कोंकणी |
| gu | Gujarati - ગુજરાતી |
| hi | Hindi - हिंदी |
| kn | Kannada - ಕನ್ನಡ |
| ks | Kashmiri - كٲشُر |
| ks_Deva | Kashmiri - कॉशुर |
| gom | Konkani Goan - कोंकणी |
| mai | Maithili - मैथिली |
| ml | Malayalam - മലയാളം |
| mni | Manipuri - ꯃꯤꯇꯩꯂꯣꯟ |
| mni_Beng | Manipuri - মিতৈলোন |
| mr | Marathi - मराठी |
| ne | Nepali - नेपाली |
| or | Oriya - ଓଡ଼ିଆ |
| pa | Panjabi - ਪੰਜਾਬੀ |
| sa | Sanskrit - संस्कृतम् |
| sat | Santali - ᱥᱟᱱᱛᱟᱲᱤ |
| sd | Sindhi - سنڌي |
| sd_Deva | Sindhi - सिंधी |
| ta | Tamil - தமிழ் |
| te | Telugu - తెలుగు |
| ur | Urdu - اُردُو |
https://github.com/AI4Bharat/indicTrans
The indictrans inference service code (and models) are released under the MIT License.