NLP model
This doc explains about the S3 bucket structure used for NLP service.
we use
oriserve-dev-nlpbucket to store the NLP files.Below, you can see the bucket and its content (Brands).
These brand names are dynamic, which are subjected to change based on
model trainingsthat mentioned below.
In brand folders, it contains their NLP models.
Inside every brand, there will be three environments -
dev,uat,prodFor every brand, there will be different models are used.
classifier_models: this model is used by most of the brands.MUSE_dual,spellchecker: these models are specific to the brand.
- In the
air-arabiabrand you can find the modelsubtheme_modelsandclassifier_models.
- In the
tata-skybrand, you can find the modelsclassifier_models/andNER_models
- For the
vodafonebrand, you can find the modelsclassifier_models/andMUSE_dual. - To confirm the which models are using for the specific brand, you can get confirmation from the
AI-team.
- In the
When the developers train the model, the respective files will be stored in Dev environment.
DevOps person have to perform data migration from
devtouatorprodaccording on the request/requirement.To do this data migration, a jenkins job has configured.
For example, we got a request for
dt ai model from dev to prodforair-arabiabrand.Log into jenkins.
- Check for view
air-arabiathen go toair-arabia. take below image for reference.

- In the above,image you will find the job
publish_air-arabia_dev_to_prod_ai.- The trained dev model will store in dev folder in s3 bucket (as we seen in starting of doc). This job will publish the dev model to prod.
- After successfully publishing onto the prod, It has a downstream job
prod_air-arabia_nlp, which will deploy the prod model onto the prod AI server.
- After success of this job, we have downstream project
prod_air-arabia_nlp. - Where,
prod_air-arabia_nlpwill deploy the latest model on prod servers.
- After successfully publishing onto the prod, It has a downstream job
- The trained dev model will store in dev folder in s3 bucket (as we seen in starting of doc). This job will publish the dev model to prod.
- Select the respective job and start build.
- After the job getting success, Verify that NLP model are migrated from dev to prod.
- Check for view