Daniel Gultsch

2 months ago

Daniel Gultsch
2 months ago

For the next #Conversations_im release I’m refactoring how URIs are linked / made clickable. I’m adding a bunch of URI schemes like tel and mailto on top of the existing xmpp, http(s) and geo but removing support for "things that look like web URLs but aren’t actually URIs" (like 'example.com') to avoid some false positives.

Once the 2.18.0-beta comes out tomorrow or so let me know if you see things that isn’t matched and should be matched or vice versa.

#conversations_im

in reply to Daniel Gultsch

Guus der Kinderen

in reply to Daniel Gultsch 2 months ago

Don't forget to pay attention to the _end_ of a URI please! It's super annoying when clicks end up going 404 because the clicked value contained a trailing comma, full-stop or parenthesis or something (eg: example.org/somepage) <-- shouldn't include the trailing `)` #petpeeve

#petpeeve

in reply to Daniel Gultsch

mistersixt

in reply to Daniel Gultsch 2 months ago

I would like to tell you about a request that I have already received several times from friends and family members: you can search for terms in the chat history. The search itself works fine, but you can't jump from a result to the actual place in the chat history. But that would be important because the messages before and/or after the search term are often just as important. Currently, you have to remember the date, switch to the chat history and then scroll (possibly indefinitely).

in reply to Daniel Gultsch

Ténno Seremél’

in reply to Daniel Gultsch 2 months ago

What about Gemini protocol links (gemini://...)? I haven’t checked yet whether they already work, though…

Unknown parent

Guus der Kinderen

Unknown parent 2 months ago

I wouldn't know how to do that either, to be honest. I'm guessing that this is a wheel already been invented by re-usable code, but maybe not for your platform. Perhaps consider dropping certain characters that are technically valid if they appear as the last character of the URI?

in reply to Daniel Gultsch

Buntbart

in reply to Daniel Gultsch 2 months ago

what about the upcoming #Taler-URLs?
docs.taler.net/taler-merchant-…

4. Merchant API Tutorial — GNU Taler

^{docs.taler.net}

#taler

Unknown parent

allo

Unknown parent 2 months ago

Maybe like this?
.*[^.)]$

(Replace .* with the actual regex)

And I would vastly prefer an URI ending in ")" being a false negative over many URIs in parentheses being a false positives.

@guusdk

@Guus der Kinderen

Unknown parent

allo

Unknown parent 2 months ago

Right ... I guess there we're at the edge cases.

If I were looking for a heuristic, I would say that if the URI has ( or ) before the end, then it can have one at the end, otherwise one can assume that it does not belong to the URL. But I can see how you can find more and more edge cases in such heuristics.

@rakoo @guusdk

@rakoo @Guus der Kinderen

Unknown parent

allo

Unknown parent 2 months ago

The computer scientist in me says you can't find matching parentheses with RegEx.

The programmer in me says look for a solution that is "good enough". An idea that would cover a lot of cases would be, for example, to assume that a URI can contain at most one pair of parentheses. The few counterexamples are rare, and they are then the rare false negatives one has to accept.

@rakoo @guusdk

@rakoo @Guus der Kinderen

in reply to allo

rakoo

in reply to allo 2 months ago

seconded, I don't even think I've ever seen a legit url ending with a ')'
@daniel @guusdk

@Daniel Gultsch @Guus der Kinderen

⇧

Daniel Gultsch 2 months ago • •

Daniel Gultsch
2 months ago