'Extract part of url with dynamic form

I need to extract part of strings that represent urls.Its from an api response and I need to get specific part(its called ASIN)

Examples

For me to extract the ASIN number, its after the /dp and before /ref part of the url string

 print(f"asin {url.split('/')[-2]}")

prints B091JJZPCM,B07P3CTC3Z

But often times, some urls has different pattern, like

https://www.amazon.com/gp/slredirect/picassoRedirect.html/ref=pa_sp_btf_aps_sr_pg1_1?ie=UTF8&adId=A10365631KNJTYYLZ8RME&qualifier=1651036806&id=2996869036517091&widgetName=sp_btf&url=%2FWustentre-Minimalist-Leather-Blocking-Tracker%2Fdp%2FB09NHTNTH8%2Fref%3Dsr_1_59_sspa%3Fkeywords%3Dmens%2Bwallet%26qid%3D1651036806%26sr%3D8-59-spons%26psc%3D1

In this case, how do you extract the ASIN part?

python-3.x

Solution 1:^[1]

Solved it by

        url = "https://amazon.com/**********************"
        asin = url.split("/")[-2]
        if "html" in asin:
            print(f"url {asin}")
            parsed_url = urlparse(url)
            captured_value = parse_qs(parsed_url.query)["url"][0]
            url = f"https://amazon.com{captured_value}"
            print(f"captured url {url}")
            asin = url.split("/")[-2]
            print(f"captured asin {asin}")

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution	Source
Solution 1	ira

'Extract part of url with dynamic form

Solution 1:[1]

Sources

Related Questions

Solution 1:^[1]