regex - parsing url parameters

code=yes

http://stackoverflow.com/questions/14679113/getting-all-url-parameters-using-regex

(\?|\&amp;)([^=]+)\=([^&amp;]+)

-----------------------------

#!/usr/bin/perl -wT

my $str = "url.html?id=14114&amp;yolo=hahaha";


while ( $str =~ m/(\?|\&amp;)([^=]+)\=([^&amp;]+)/igs ) {
    print "1 = $1\n";
    print "2 = $2\n";
    print "3 = $3\n";
}



====================================

http://stackoverflow.com/questions/27745/getting-parts-of-a-url-regex



Given the URL (single line):
http://test.example.com/dir/subdir/file.html

How can I extract the following parts using regular expressions:

    The Subdomain (test)
    The Domain (example.com)
    The path without the file (/dir/subdir/)
    The file (file.html)
    The path with the file (/dir/subdir/file.html)
    The URL without the path (http://test.example.com)
    (add any other that you think would be useful)

The regex should work correctly even if I enter the following URL:
http://example.example.com/example/example/example.html

Thank you.


------------------



    A single regex to parse and breakup a full URL including query parameters and anchors e.g.

    https://www.google.com/dir/1/2/search.html?arg=0-a&amp;arg1=1-b&amp;arg3-c#hash

    ^((http[s]?|ftp):\/)?\/?([^:\/\s]+)((\/\w+)*\/)([\w\-\.]+[^#?\s]+)(.*)?(#[\w\-]+)?$

    RexEx positions:

    url: RegExp['$&amp;'],

    protocol:RegExp.$2,

    host:RegExp.$3,

    path:RegExp.$4,

    file:RegExp.$6,

    query:RegExp.$7,

    hash:RegExp.$8

you could then further parse the host ('.' delimited) quite easily.

What I would do is use something like this:

/*
    ^(.*:)//([a-z\-.]+)(:[0-9]+)?(.*)$
*/
proto $1
host $2
port $3
the-rest $4


---------------------



#regex