whatwg/html

Escaped characters in source

Open

#3,683 创建于 2018年5月14日

在 GitHub 查看
 (4 评论) (0 反应) (0 负责人)HTML (7,654 star) (2,520 fork)batch import
good first issue

描述

The current source file has a large number of encoded entities. This makes it rather hard to edit and read. As UTF-8 is everywhere, is it time to replace these with their Unicode representation?

For example:

  <li value="9"><cite lang="sh">Црна мачка, бели мачор</cite>, 1998</li>

Becomes:

  <li value="9"><cite lang="sh">Црна мачка, бели мачор</cite>, 1998</li>

And

<p w-nodev>In an algorithm, steps in <span data-x="synchronous section">synchronous
  sections</span> are marked with &#x231B;.</p>

Could be changed to:

<p w-nodev>In an algorithm, steps in <span data-x="synchronous section">synchronous
  sections</span> are marked with ⌛.</p>

There is one obvious exception - invisible / non-printing characters.

Would you be interested in a pull request to transform all the &#x... references to decoded equivalent?

This builds upon the HTML5.3 work done in https://github.com/w3c/html/pull/1280

贡献者指南