For word-level analysis four additional dataframes are created for one of the six model combinations, namely LatinISE for a general language model and ROMTEXT for a legal language model. The words in the dataframes are sorted in descending order of overlap or weighted similarity. Those on the top are the words which feature prominently in both models.
For word-level analysis four additional dataframes are created for one of the six model combinations, namely the combination of LatinISE and ROMTEXT, one general and one legal language model. The words in the dataframes are sorted in descending order of overlap or weighted similarity. Those on the top are the words which feature prominently in both models.