Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong encoding in url_parse #102

Open
strohne opened this issue Dec 8, 2019 · 0 comments
Open

Wrong encoding in url_parse #102

strohne opened this issue Dec 8, 2019 · 0 comments

Comments

@strohne
Copy link

strohne commented Dec 8, 2019

Everybody loves encoding issues ;) When parsing urls containing non-ascii-characters the encoding of the domain is messed up and I have not found a way to fix it yet. That's how it works:

# Create UTF-8 string
url <- "https://exämple.org"

#  Conversion is necessary in my RStudio environment
url <- iconv(url,"latin1","UTF-8")
Encoding(url)  # UTF-8
print(url)        # https://exämple.org
 
# Parse
url_parse(url)

Output for the domain part is ex<e3><U+00A4>mple.org. Expected: exämple.org.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant