python examples / usage for known urls :)
This commit is contained in:
25
README.md
25
README.md
@@ -245,6 +245,31 @@ print(archive_count) # total_archives() returns an int
|
|||||||
|
|
||||||
<sub>Try this out in your browser @ <https://repl.it/@akamhy/WaybackPyTotalArchivesExample></sub>
|
<sub>Try this out in your browser @ <https://repl.it/@akamhy/WaybackPyTotalArchivesExample></sub>
|
||||||
|
|
||||||
|
#### List of URLs that Wayback Machine knows and has archived for a domain name
|
||||||
|
|
||||||
|
1) If alive=True is set, waybackpy will check all URLs to identify the alive URLs. Don't use with popular websites like google or it would take too long.
|
||||||
|
2) To include URLs from subdomain set sundomain=True
|
||||||
|
|
||||||
|
```python
|
||||||
|
import waybackpy
|
||||||
|
|
||||||
|
URL = "akamhy.github.io"
|
||||||
|
UA = "Mozilla/5.0 (iPad; CPU OS 8_1_1 like Mac OS X) AppleWebKit/600.1.4 (KHTML, like Gecko) Version/8.0 Mobile/12B435 Safari/600.1.4"
|
||||||
|
|
||||||
|
known_urls = waybackpy.Url(url=URL, user_agent=UA).known_urls(alive=True, subdomain=False) # alive and subdomain are optional.
|
||||||
|
|
||||||
|
print(known_urls) # known_urls() returns list of URLs
|
||||||
|
```
|
||||||
|
|
||||||
|
```bash
|
||||||
|
['http://akamhy.github.io',
|
||||||
|
'https://akamhy.github.io/waybackpy/',
|
||||||
|
'https://akamhy.github.io/waybackpy/assets/css/style.css?v=a418a4e4641a1dbaad8f3bfbf293fad21a75ff11',
|
||||||
|
'https://akamhy.github.io/waybackpy/assets/css/style.css?v=f881705d00bf47b5bf0c58808efe29eecba2226c']
|
||||||
|
```
|
||||||
|
|
||||||
|
<sub>Try this out in your browser @ <https://repl.it/@akamhy/WaybackPyKnownURLsToWayBackMachineExample#main.py></sub>
|
||||||
|
|
||||||
### With the Command-line interface
|
### With the Command-line interface
|
||||||
|
|
||||||
#### Save
|
#### Save
|
||||||
|
Reference in New Issue
Block a user