Split input document into paragraphs

Document should be split on "\n\n", result: n paragraphs per document
Detect the language of each paragraph
Apply the correct MLP pipeline according to the detected language of each paragraph
Put the entire document back together
Adjust the initial fact spans