Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upBroken Russian links - HELP REQUESTED #3269
Open
Labels
Comments
|
Hi @eshellman >>>target_url="https://raw.githubusercontent.com/EbookFoundation/free-programming-books/master/free-programming-books-ru.md"
>>>import requests
>>>data = requests.get(target_url).text
>>>> urls = list(filter(lambda x: x.startswith("https://ru.wikibooks.org/"), data.replace("("," ").replace(")","").split(" ")))
# ['https://ru.wikibooks.org/wiki/Ассемблер_в_Linux_для_программистов_C', 'https://ru.wikibooks.org/wiki/Язык_Си_в_примерах', 'https://ru.wikibooks.org/wiki/Учебник_Python_2.6', 'https://ru.wikibooks.org/wiki/Ruby']
>>> for url in urls:
... print("[{}]:{}, {}".format(url,requests.get(url).status_code, requests.get(url).url))
...
[https:/ru.wikibooks.org/wiki/Ассемблер_в_Linux_для_программистов_C]:200, https:/ru.wikibooks.org/wiki/%D0%90%D1%81%D1%81%D0%B5%D0%BC%D0%B1%D0%BB%D0%B5%D1%80_%D0%B2_Linux_%D0%B4%D0%BB%D1%8F_%D0%BF%D1%80%D0%BE%D0%B3%D1%80%D0%B0%D0%BC%D0%BC%D0%B8%D1%81%D1%82%D0%BE%D0%B2_C
[https:/ru.wikibooks.org/wiki/Язык_Си_в_примерах]:200, https:/ru.wikibooks.org/wiki/%D0%AF%D0%B7%D1%8B%D0%BA_%D0%A1%D0%B8_%D0%B2_%D0%BF%D1%80%D0%B8%D0%BC%D0%B5%D1%80%D0%B0%D1%85
[https:/ru.wikibooks.org/wiki/Учебник_Python_2.6]:200, https:/ru.wikibooks.org/wiki/%D0%A3%D1%87%D0%B5%D0%B1%D0%BD%D0%B8%D0%BA_Python_2.6
[https:/ru.wikibooks.org/wiki/Ruby]:200, https:/ru.wikibooks.org/wiki/RubyAs we can see from python code output all wikibooks pages available. Also I suppose that |
|
looks like encoding problems to me. Unicode characters need to be % encoded in the url. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Below is a list of failing links in our Russian lists
You can help improve this repo by resolving the problems.
A failing link may be a website or resource that has disappeared. You can submit a PR
Sometimes a link is only temporarily failing, or the site blocks link-checkers. If that's the case, leave a note on this issue.
In your PR, reference this issue.
Thanks for your help!
free-programming-books-ru.md