Split input document into paragraphs
- Document should be split on "\n\n", result: n paragraphs per document
- Detect the language of each paragraph
- Apply the correct MLP pipeline according to the detected language of each paragraph
- Put the entire document back together
- Adjust the initial fact spans