cd8a32ed1f
added tests for cdx_snapshot.py at tests/test_cdx_snapshot.py
Akash Mahanty
2022-01-24 16:29:44 +05:30
57512c65ff
change test oldest method from google.com to example.com, the oldest on google is for some unknown reason is not very stable.
Akash Mahanty
2022-01-24 16:27:35 +05:30
2bea92b348
fix bug with the third matching case of the archive_url_parser, caught while writing more tests fo the save API interface.
Akash Mahanty
2022-01-24 13:31:30 +05:30
d506685f68
added some tests for save_api interface
Akash Mahanty
2022-01-23 18:35:54 +05:30
7844d15d99
close the session in save api interface
Akash Mahanty
2022-01-23 18:34:06 +05:30
c0252edff2
updated tests for availability_api.py and also added max_tries(default value is 3) with delay (sleep) between successive API calls. The dealy actually improves the performace of the availability_api interface.
Akash Mahanty
2022-01-23 15:05:10 +05:30
e7488f3a3e
added test badge, rename test to Tests from ubuntu and fix the Incomplete URL substring sanitization(or trying to)
Akash Mahanty
2022-01-23 02:26:53 +05:30
aed75ad1db
Make modules imprtable as part of a Python package, waybackpy by creating __init__.py file in tests
Akash Mahanty
2022-01-23 02:14:38 +05:30
d740959c34
more dev reqs
Akash Mahanty
2022-01-23 02:10:12 +05:30
2d83043ef7
+ flake8 in requirements-dev.txt
Akash Mahanty
2022-01-23 02:05:08 +05:30
31b1056217
fix typo in CI
Akash Mahanty
2022-01-23 02:03:30 +05:30
97712b2c1e
add CI unit_test.yml
Akash Mahanty
2022-01-23 02:00:15 +05:30
a8acc4c4d8
Fix Incomplete URL substring sanitization in the last commit.
Akash Mahanty
2022-01-23 01:42:48 +05:30
1bacd73002
created pytest.ini, added test for waybackpy/availability_api.py, new exceptions all of which inherit from the main WaybackError and created requirements-dev.txt
Akash Mahanty
2022-01-23 01:29:07 +05:30
405e9a2a79
waybackpy/save_api.py : Added doc strings and also lint with black.
Akash Mahanty
2022-01-22 00:41:10 +05:30
db551abbf6
lint waybackpy/cdx_api.py and added some doc strings
Akash Mahanty
2022-01-22 00:11:35 +05:30
d13dd4db1a
added notice on waybackpy/wrapper.py that the Url class will cease to exist after 2024-01-01 and also removed unused imports.
Akash Mahanty
2022-01-21 23:14:20 +05:30
d3bb8337a1
make setup.py smarter, now no need to update the URL again and also added more keywords. And in __version__.py updated the __author__
Akash Mahanty
2022-01-21 23:01:09 +05:30
66e16d6d89
define __repr__ for the Availability API class
Akash Mahanty
2022-01-18 20:34:21 +05:30
16b9bdd7f9
output the file name if known_url and file flag are passed.
Akash Mahanty
2022-01-18 20:14:44 +05:30
7adc01bff2
implement known_urls for cli from the newer interface. Although use of CDX is recommended but backward-compatibility matters.
Akash Mahanty
2022-01-18 20:07:12 +05:30
2ab44391cfclose#107, added link to SecSI/Docker image
Akash Mahanty
2022-01-16 23:01:31 +05:30
cc3628ae18
define __str__ for objects of WaybackMachineAvailabilityAPI class, the check for self.JSON ensures that the API was atleast called.
Akash Mahanty
2022-01-16 22:28:12 +05:30
1d751b942b
invoke json, was a bad idea removing it the earlier commit as the end user should not have to call it
Akash Mahanty
2022-01-16 22:15:25 +05:30
261a867a21
near() method of WaybackMachineAvailabilityAPI return self to preserve past behaviour
Akash Mahanty
2022-01-16 21:53:54 +05:30
2e487e88d3
define __len__ on Url objects, if any method not used prior to len op then default to len of oldest archive.
Akash Mahanty
2022-01-16 21:29:43 +05:30
c8d0ad493a
defined __str__ for Url objects, print func should print the url.
Akash Mahanty
2022-01-16 21:22:43 +05:30
09290f88d1
fix one more error
Akash Mahanty
2021-01-24 16:58:53 +05:30
e5835091c9
import re
Akash Mahanty
2021-01-24 16:56:59 +05:30
7312ed1f4f
set cached_save to True if archive older than 3 mins.
Akash Mahanty
2021-01-24 16:53:36 +05:30
6ae8f843d3
add --file to --known_urls
Akash Mahanty
2021-01-24 16:15:11 +05:30
36b936820b
known urls now yileds, more reliable. And save the file in chucks wrt to response. --file arg can be used to create output file, if --file not used no output will be saved in any file. (#88)
Akash Mahanty
2021-01-24 16:11:39 +05:30
a3bc6aad2b
too much API usage by duplicate tests was causing too much tests failure
Akash Mahanty
2021-01-23 21:08:21 +05:30
ffe0810b12
flag to check if the archive saved is 30 mins older or not
Akash Mahanty
2021-01-16 12:06:08 +05:30
40233eb115
improve code quality, remove unused imports, use system randomness etc
Akash Mahanty
2021-01-16 11:35:13 +05:30
d549d31421
improve save method, now we know that 302 errors indicates that wayback machine is archiving the URL and hasn't yet archived. We construct an artifical archive with the current UTC time and check for HTTP status code 20* or 30*. If we verify the archival, we return the artifical archive. The artificial archive will automatically point to the new archive or in best case will be the new archive after some time.
Akash Mahanty
2021-01-16 10:47:43 +05:30
0725163af8
mimify the logo, remove ugly old logos
Akash Mahanty
2021-01-15 18:14:48 +05:30
712471176b
better error messages(str), check latest version before asking for an upgrade and rm alive checking
Akash Mahanty
2021-01-15 16:47:26 +05:30
dcd7b03302
getting rid of c style str formatting, now using .format
Akash Mahanty
2021-01-14 19:30:07 +05:30
76205d9cf6
backoff_factor=2 for save, incr success by 25%
Akash Mahanty
2021-01-13 10:13:16 +05:30