Wrong encoding in url_parse #102

strohne · 2019-12-08T14:16:03Z

Everybody loves encoding issues ;) When parsing urls containing non-ascii-characters the encoding of the domain is messed up and I have not found a way to fix it yet. That's how it works:

# Create UTF-8 string
url <- "https://exämple.org"

#  Conversion is necessary in my RStudio environment
url <- iconv(url,"latin1","UTF-8")
Encoding(url)  # UTF-8
print(url)        # https://exämple.org
 
# Parse
url_parse(url)

Output for the domain part is ex<e3><U+00A4>mple.org. Expected: exämple.org.

The text was updated successfully, but these errors were encountered:

EmilBode mentioned this issue Sep 29, 2020

Encoding of url_decode #108

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong encoding in url_parse #102

Wrong encoding in url_parse #102

strohne commented Dec 8, 2019

Wrong encoding in url_parse #102

Wrong encoding in url_parse #102

Comments

strohne commented Dec 8, 2019