You are given a task of:
Analyzinga big dataset 2TBs of contractual JSON docs organized by doc_id
Taggingentities (companies) in the documents using entity_id from a collection of companyprofiles
Buildinga search system that in response to the query <company_name> :
Retrievesrelated documents ranked by their relevance
Retrievesrelated companies and their products
Question:What kinds of databases you would use for this task and why?
The attached file gives you more information.