Stream: zulip

Topic: Zulip archive charset


view this post on Zulip Nemo157 (Aug 30 2020 at 18:50):

It appears the zulip archive does not specify that it's using UTF-8, resulting in broken rendering for pages like https://zulip-archive.rust-lang.org/233931tcompilermajorchanges/42684RequireuserstoconfirmtheyknowRUSTCcompilerteam350.html that include non-ASCII characters.

view this post on Zulip Joshua Nelson (Aug 30 2020 at 18:55):

that stream doesn't have non-ascii :/ the weird encoding is ...

view this post on Zulip Joshua Nelson (Aug 30 2020 at 18:55):

unless triagebot posted an ellipsis character instead of three dots?

view this post on Zulip Nemo157 (Aug 30 2020 at 19:27):

yes, it's a unicode ellipsis, and sent to the browser in UTF-8 encoding

view this post on Zulip Nemo157 (Aug 30 2020 at 19:27):

which at least for me renders as … because the web still doesn't default to UTF-8

view this post on Zulip Nemo157 (Aug 30 2020 at 19:28):

(and near the bottom is 🥺 which I assume is some emoji :grinning:)

view this post on Zulip bjorn3 (Aug 30 2020 at 19:28):

<meta charset="utf-8">

view this post on Zulip Nemo157 (Aug 30 2020 at 19:29):

yep, I could even provide a PR to add that if anyone knows where the zulip archive is generated from?

view this post on Zulip davidtwco (Aug 30 2020 at 19:29):

https://github.com/rust-lang/zulip_archive

view this post on Zulip bjorn3 (Aug 30 2020 at 19:30):

Probably https://github.com/zulip/zulip-archive

view this post on Zulip simulacrum (Aug 30 2020 at 19:30):

https://github.com/rust-lang/zulip_archive

view this post on Zulip davidtwco (Aug 30 2020 at 19:30):

I think we've found it

view this post on Zulip simulacrum (Aug 30 2020 at 19:30):

we have our own fork, I can deploy it when needed

view this post on Zulip simulacrum (Aug 30 2020 at 19:30):

(and r? me on PRs)

view this post on Zulip Nemo157 (Aug 30 2020 at 19:34):

hmm, so that appears to generate markdown files, but then how are those rendered to html?

view this post on Zulip simulacrum (Aug 30 2020 at 23:05):

We run them through pulldown-cmark (potentially a bit outdated, we can update if needed)


Last updated: Oct 11 2021 at 22:34 UTC