Commit Graph

104 Commits

Author SHA1 Message Date
Akash Mahanty
58cd9c28e7 Threading enabled checking for URLs 2020-11-26 06:15:42 +05:30
Akash Mahanty
5088305a58 removed python2 compatibility code 2020-11-21 17:00:11 +05:30
Akash Mahanty
7f927ec7be added tests for json and archive_url, updated broken tests (#34)
* added tests for json and archive_url, updated broken tests

* drop 2.7 support
2020-10-16 19:25:45 +05:30
danvalen1
91e7f65617 Fixing len() bug (#32)
* added class functionality

* Update wrapper.py

* style edits

* fixed bug with len() of url()

* fixing len() bug

* fixing len() bug

* squashing bug

* removed test notebook
2020-10-16 10:04:13 +05:30
danvalen1
d465454019 Adding attributes to Url class (#28)
* added class functionality

* Update wrapper.py

* style edits
2020-10-15 22:10:32 +05:30
Akash Mahanty
1a81eb97fb lint 2020-10-03 16:58:11 +05:30
Akash Mahanty
ce7294d990 Implemented new feature, known urls for domain. 2020-10-02 20:27:28 +05:30
Akash
ca9186c301 update message, sometimes raised for poor performance by wayback machine even if the url is archived. 2020-08-09 10:43:16 +05:30
Akash
8a4b631c13 new regex to parse archive, IA changed the header again :( 2020-08-09 10:36:25 +05:30
Akash
56116551ac Coverge improvements (#22)
* Update cli.py

* improved tests

* chnages for proper testing

* Type check using isinstance

* Replace elifs with if when used after return

* twitter.com --> www.ibm.com

* fix typo

* test archive urll parser and dunders

* Update test_wrapper.py
2020-07-24 15:31:21 +05:30
Akash
ed24184b99 Remove duplicate get response method 2020-07-24 00:57:22 +05:30
Akash
dee9105794 command_line support (#18)
* Update wrapper.py

* entry points cli

* Suppress the urllib2/3 Exception

* rm cli code, will create a new cli.py file

* Create cli.py

* update cli entry pts

* Update cli.py

* Update cli.py

* import print_function

* Update cli.py

* Update cli.py

* Delete pypi_uploader.sh

* resolve conflicts with the master

* update the test ; resolve the conflicts

* decrease code complexity

* cli method changed to main

* get is not for just local usage

* get method should be available from interface

* get is used in the interface

* Update cli.py
2020-07-22 16:40:13 +05:30
Akash
b3a7e714a5 Update wrapper.py 2020-07-22 10:57:43 +05:30
Akash
cd9841713c Update wrapper.py 2020-07-22 10:52:43 +05:30
AntiCompositeNumber
1ea9548d46 Raise WaybackError from URLError and include URL (#19)
* Raise WaybackError from URLError and include URL

* python2 compatibility

Co-authored-by: Akash <64683866+akamhy@users.noreply.github.com>
2020-07-22 10:51:44 +05:30
AntiCompositeNumber
be7642c837 Code style improvements (#20)
* Add sane line length to setup.cfg

* Use Black for quick readability improvements

* Clean up exceptions, docstrings, and comments

Docstrings on dunder functions are redundant and typically ignored
Limit to reasonable line length
General grammar and style corrections
Clarify docstrings and exceptions
Format docstrings per PEP 257 -- Docstring Conventions

* Move archive_url_parser out of Url.save()

It's generally poor form to define a function in a function, as it will
be re-defined each time the function is run.

archive_url_parser does not depend on anything in Url, so it makes sense
to move it out of the class.

* move wayback_timestamp out of class, mark private functions

* DRY in _wayback_timestamp

* Url._url_check should return None

There's no point in returning True if it's never checked and won't ever
be False.
Implicitly returning None or raising an exception is more idiomatic.

* Default parameters should be type-consistant with expected values

* Specify parameters to near

* Use datetime.datetime in _wayback_timestamp

* cleanup __init__.py

* Cleanup formatting in tests

* Fix names in tests

* Revert "Use datetime.datetime in _wayback_timestamp"

This reverts commit 5b30380865.

Introduced unnecessary complexity

* Move _get_response outside of Url

Because Codacy reminded me that I missed it.

* fix imports in tests
2020-07-22 10:09:14 +05:30
Akash
8fd4462025 Update wrapper.py 2020-07-20 20:17:18 +05:30
Akash
f3bb9a8540 Update wrapper.py 2020-07-20 10:11:36 +05:30
Akash
bb94e0d1c5 Update index.rst and remove dupes 2020-07-20 10:07:31 +05:30
Akash
83c962166d Raise 2020-07-19 23:02:04 +05:30
Akash
8ab116f276 API chnaged again. updated
* Update wrapper.py

* Update wrapper.py

* Update wrapper.py

* Update wrapper.py

* Update wrapper.py

* api changed; fix archive url parser

* Update wrapper.py

* - Trailing whitespace

* include the header in exception
2020-07-19 20:39:07 +05:30
Akash
58d2d585c8 No timeout for final try 2020-07-18 18:29:41 +05:30
Akash
0ad27f5ecc update readme for newer oop and some test changes (#12)
* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* docstrings

* user agent ; more variants

* description update

* Update __init__.py

* # -*- coding: utf-8 -*-

* Update test_1.py

* update docs for get()

* Update README.md
2020-07-18 16:22:09 +05:30
Akash
f2112c73f6 Python 2 support 2020-07-17 21:08:32 +05:30
Akash
9860527d96 OOP (#10)
* Update wrapper.py

* Update exceptions.py

* Update __init__.py

* test adjusted for new changes

* Update wrapper.py
2020-07-17 20:50:00 +05:30
Akash
f881705d00 detecet python version whith sys.version_info (#9) 2020-06-26 15:48:01 +05:30
akamhy
42ac399362 Most efficient method to count (yet) 2020-05-08 09:47:13 +05:30
akamhy
e9d010c793 just count the status code, consumes less memory 2020-05-08 09:28:18 +05:30
akamhy
0c4f119981 Update wrapper.py 2020-05-07 17:25:34 +05:30
akamhy
afded51a04 Update wrapper.py 2020-05-07 17:20:23 +05:30
akamhy
b950616561 Update wrapper.py 2020-05-07 17:17:17 +05:30
akamhy
444675538f fix code Complexity (#8)
* fix code Complexity

* Update wrapper.py

* codefactor badge
2020-05-07 16:51:08 +05:30
akamhy
0ca6710334 Update wrapper.py 2020-05-07 16:24:33 +05:30
akamhy
01a7c591ad retry 2020-05-07 15:46:39 +05:30
akamhy
74d3bc154b fix issue with py2.7 2020-05-07 15:34:41 +05:30
akamhy
8c5c0153da + total_archives() 2020-05-07 14:52:05 +05:30
akamhy
db127a5c54 always return https 2020-05-06 20:16:25 +05:30
akamhy
ed497bbd23 Update wrapper.py 2020-05-06 20:07:25 +05:30
akamhy
45fe07ddb6 Update wrapper.py 2020-05-06 19:35:01 +05:30
akamhy
a6cb955669 Update wrapper.py 2020-05-05 10:04:40 +05:30
akamhy
8acb14a243 Update wrapper.py 2020-05-05 10:00:29 +05:30
akamhy
7d434c3f0f Update wrapper.py 2020-05-05 09:57:39 +05:30
akamhy
057c61d677 Update wrapper.py 2020-05-05 09:48:39 +05:30
akamhy
6705c04f38 Update wrapper.py 2020-05-05 09:43:13 +05:30
akamhy
09b4ba2649 Version 1.2 with bug fixes and support for webpage retrieval (#4) 2020-05-05 09:03:16 +05:30
akamhy
509426a1f7 Update wrapper.py 2020-05-04 16:01:01 +05:30
akamhy
997b654881 Update wrapper.py 2020-05-04 15:33:00 +05:30
akamhy
5ad2886bcb python 2.x support 2020-05-04 09:49:40 +05:30
akamhy
9ab503f4ac Update wrapper.py 2020-05-04 09:01:33 +05:30
akamhy
53561ec9c4 Update wrapper.py 2020-05-04 08:56:01 +05:30