codex-team/editor.js

Pasting from MS Word places random new line characters in the paragraph text

Open

#2,350 opened on May 4, 2023

View on GitHub
 (5 comments) (0 reactions) (0 assignees)TypeScript (26,602 stars) (1,985 forks)batch import
buggood first issue

Description

Describe a bug.

When pasting from MS Word into the editor, random new line characters (\n) get placed into the resultant JSON [within the paragraph text itself].

Steps to reproduce:

  1. Open the editor
  2. Copy from Word and paste.
  3. Save
{
  "id": "1LRfwKqMBi",
  "type": "paragraph",
  "data": {
      "text": "For three days it rained and everything he carried except\nfor..."
  }
},

These line breaks are not in the Word doc. I looked at the xml inside the .docx. Also, if you type everything into the editor, it doesn't put random line breaks.

Expected behavior:

No extra line breaks.

Why does this matter?

My content management system has an option to strip all line breaks before delivering the html to the browser to cut down on final size.

<html><head>...</head><body>...</body></html>

I can't activate that option because then the paragraphs with random line breaks have words mashed together.

Editor.js version:

v2.26.5

Plugins you use with their versions:

No plugins

Contributor guide

Pasting from MS Word places random new line characters in the paragraph text · codex-team/editor.js#2350 | Good First Issue