Forwarded from Bot Ideas ¦ for #devs
⏩ Proper Website Scraper
A bot that simply scrapes the text off a website, formatting it into Telegram-compatible html and emoji markup. That would be great!
If anyone is looking for #project #ideas, this might be something cool. One might use https://github.com/aaronsw/html2text and requests to scrape and convert the content, and then replace markdown or remaining html tags such as
◻️Headlines
and
▪️bullet points
with emojified formatting options.
This could then send the content as a message or push to to an instant-view compatible service (telegra.ph).
Then, on top of that, make html selectors filterable - so that you can scrape only the
A bot that simply scrapes the text off a website, formatting it into Telegram-compatible html and emoji markup. That would be great!
If anyone is looking for #project #ideas, this might be something cool. One might use https://github.com/aaronsw/html2text and requests to scrape and convert the content, and then replace markdown or remaining html tags such as
◻️Headlines
and
▪️bullet points
with emojified formatting options.
This could then send the content as a message or push to to an instant-view compatible service (telegra.ph).
Then, on top of that, make html selectors filterable - so that you can scrape only the
code-formatted
text blocks in an online tutorial for quicker references and in order to share them with colleagues.GitHub
GitHub - aaronsw/html2text: Convert HTML to Markdown-formatted text.
Convert HTML to Markdown-formatted text. Contribute to aaronsw/html2text development by creating an account on GitHub.