Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support possible German umlauts in email address strings #88

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

rstammer
Copy link

In German language, and of course in many others, there are special characters, i.e. the so-called umlauts in case of German.

Under some circumstances these characters are supported by modern SMTP servers, see i.e. this discussion. (this is also an interesting related read)

We are also experiencing this within our data, but unfortunately rinkus auto_link mechanics fail to handle email addresses for mailto-links correctly, if the email address contains an umlaut.

Current behavior

Rinku.auto_link("björn-jürgen.nuß[email protected]") 
# björn-jürgen.nuß<a href="mailto:[email protected]">[email protected]</a>

Expected behavior

Rinku.auto_link("björn-jürgen.nuß[email protected]") 
# <a href="mailto:björn-jürgen.nuß[email protected]">björn-jürgen.nuß[email protected]</a>

Discussion

Our PR is restricted only on the case of German umlauts, but of course we see that a wider range of characters are affected for other languages, i.e. ñ or cases like é. We were not 100% certain how to proceed for this and would be happy for some advice. Is the approach we've taken in the code, namely extending the lookup for special characters, the right one here?

Maybe we're on the wrong track and the support for those characters should get added to rinkus rinku_isalnum() function. What do you think?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant