lundi 21 avril 2014

Regular expression help - what's wrong?


Vote count:

1




I would like to ask for help with my regex. I need to extract the very last part from each URL. I marked it as 'to_extract' within the example below.


I want to know what's wrong with the following regex when used with sed:



sed 's/^[ht|f]tp.*\///' file.txt


Sample content of file.txt:



http://a/b/c/to_extract
ftp://a/b/c/to_extract
...


I am getting only correct results for the ftp links, not for the http. Thanks in advance for your explanation on this. i.



asked 2 mins ago

ivan

21

1 Answer



Vote count:

3




Change [ht|f] to (ht|f), that would give better results.


[abc] means "one character which is a, b or c".


[ht|f] means "one character which is h, t, | or f", not at all what you want.



answered 2 mins ago

dystroy

127k




Aucun commentaire:

Enregistrer un commentaire