Link Extractor

Pull every URL out of a wall of text or a chunk of HTML. Remove duplicates, sort, filter by domain, and export as a list, CSV, Markdown, or JSON. It all runs in your browser — no signup, no upload, your text never leaves your device.

Text — scan prose for URLs HTML — parse markup for href/src links

Paste text or HTML

Options

Remove duplicates Sort Filter by domain or keyword Show relative links (HTML) Resolve relative links against base URL (HTML)

0 found · 0 unique · 0 duplicates removed · 0 domains

Extracted links

🔒 Parsed in your browser. Nothing is uploaded, and links are never fetched or visited.

What is the Link Extractor?

A link extractor pulls every URL out of a blob of text or HTML so you do not have to hunt for them by hand. Paste a newsletter, a chat log, a page of copied source code, or a messy export, and the link extractor hands back a clean list of every web address it can find. It works two ways: a text mode that scans ordinary prose for URL patterns, and an HTML mode that actually parses the markup and pulls links out of the attributes browsers use. You stay in control with options to remove duplicates, sort the list, and filter down to a single domain.

Text mode vs HTML mode

This is the part that sets the tool apart. Text mode treats your input as plain prose and scans it with a practical URL pattern, catching http, https, ftp, and bare www. addresses wherever they appear in the writing. That is perfect for newsletters, emails, documents, and chat transcripts where links sit inside sentences.

HTML mode is different and more precise. It runs your input through the browser's own HTML parser and reads links straight out of href, src, action, and poster attributes. Those are exactly the links a plain text scan would miss, because in real markup the address lives inside a tag rather than in readable text. HTML mode also surfaces relative links such as /about or img.png, and it can resolve them against a base URL you supply so they come back as full absolute links. When you have pasted page source, HTML mode is the right choice.

How to use it

Pick a source mode: Text for prose, HTML for markup.
Paste your text or HTML into the box.
The links appear instantly, with a live count of total, unique, duplicates removed, and unique domains.
Adjust the options if you want: remove duplicates, sort, or filter to one domain.
Choose an output format and copy all or download, then close the tab.

Cleaning the results

Raw extraction often leaves a noisy list, so the tool gives you simple controls. Duplicate removal is on by default and the stats show how many repeats were dropped. Sorting puts the list in alphabetical order or groups it by hostname so links from the same site land together. The domain filter keeps only the links whose address contains the text you type, which is the fastest way to isolate one site or strip out a tracker.

Export formats

When you are done you can export the links four ways. A plain newline list is best for pasting into a sheet or another tool. Comma-separated output drops cleanly into a spreadsheet cell or a config value. Markdown turns each link into a ready-to-paste Markdown link for notes and documentation. JSON gives you a string array you can drop straight into code.

Common use cases

SEO audits — checking the outbound links on a page before or after a change.
Scraping research — harvesting every URL from a copied or scraped blob.
Notes and docs — turning a wall of bare links into a tidy Markdown list.
Citations — collecting reference links out of a long document.
Domain inventory — seeing which domains a piece of content points to.

Why use this one

Most free extractors are a single regex box that dumps whatever it matches, with no dedupe, no sort, and no real HTML parsing. Others lock the useful stats and exports behind a premium upgrade. This tool gives you both source modes on one page, real processing controls, and four export formats with nothing gated. It also keeps a promise the crawler-style competitors cannot: everything runs in your browser. The tool never fetches, resolves, or visits any link in your input — it only parses text locally, so your content never leaves your device and there is nothing to upload. If you also need to tidy the surrounding text, the Text Formatter collapses extra spaces, the Find and Replace tool can rewrite every link at once, Remove Line Breaks flattens the list onto one line, and the Case Converter fixes capitalization on the same pasted text.

Frequently asked questions

What is the difference between Text mode and HTML mode?

Text mode scans plain text for URL patterns, which is ideal for newsletters, chat logs, and documents. HTML mode parses the markup and pulls links out of href, src, action, and poster attributes, and it can also handle relative links. Use HTML mode whenever you have pasted page source.

Does it extract links from href attributes in HTML?

Yes. That is exactly what HTML mode is for. It reads href, src, action, and poster attributes using the browser's own HTML parser, so it captures links that a plain text scan would structurally miss.

Can it remove duplicate links?

Yes. Duplicate removal is on by default, and the live stats show how many duplicates were removed. Turn it off if you want to keep every occurrence in the original order.

Can I get only the links from one domain?

Yes. Type a domain or keyword in the filter box and the results are narrowed to links whose host or address contains that text, so you can isolate a single site or path.

How do I export the links as Markdown?

Choose the Markdown output format, then copy or download. Each link comes out as a Markdown link ready to paste straight into notes or documentation. CSV and JSON formats are available too.

Is my text uploaded anywhere?

No. All extraction happens locally in your browser. Nothing is sent to a server, and the tool never fetches or visits the links — it only lists them, so there is no network round-trip at all.

Does it find relative links like /about or img.png?

In HTML mode, yes. Relative links are shown by default, and you can optionally resolve them against a base URL so they come out as full absolute links.