presenting an article using pre-formatted template, useful for channels having a basic same content's information before every post.
This is a project on comprehensive data proxy to real-world knowledge for providing structured access to heterogenous web data from various sources clearly emphasizing simplicity and interoperability to make the webpages inspectable at ease.
Just text? No! It will support media files like pictures and audio as well.
Forwarded from @Red
The idea I had is to send an already formatted text to the bot and automatically create the article with the text already formatted
📃Display a table of contents [toc] of the article, by extracting:
- Headings and subheadings (h1,h2,h2)
- Lists (ul, ol)
- Other hierarcghical html elements (dl, dd, table?)
Problems:
===
- Detect existing toc for articles that already have them (e.g. wikipedia) and
1. Ignore or
2. Reuse
- False positives (useless headings),
- maybe just check for headings inside
- fuzzy matching by class or other attribute?
- try to use the xpath from IV competition
- quality metrics
- False negatives (missed headings or toc)
- article not organized (so quality metrics)
- headings not in h1, h2, etc.. tags
- headings nested badly (using all h1 and div for ex.)
- Headings and subheadings (h1,h2,h2)
- Lists (ul, ol)
- Other hierarcghical html elements (dl, dd, table?)
Problems:
===
- Detect existing toc for articles that already have them (e.g. wikipedia) and
1. Ignore or
2. Reuse
- False positives (useless headings),
- maybe just check for headings inside
<article>
elements?)- fuzzy matching by class or other attribute?
- try to use the xpath from IV competition
- quality metrics
- False negatives (missed headings or toc)
- article not organized (so quality metrics)
- headings not in h1, h2, etc.. tags
- headings nested badly (using all h1 and div for ex.)
Forwarded from 🤖 Bot Ideas ¦ Collab ✍️ (Channel Hash Bot)
#request #idea (user-)bot to summarize what goes on in a group.
@topics_bot was this, but it no longer works?
limitations:
- normal bots can't see previos messages posted before it was aded to the group
features:
* participants (count + main ppl)
* topics (words / phrases by rank(num))
* high level (conversation count?) (how?)
* other metadata:
- counts
- entity summary (links, media?)
* ignore noise? (shitposting?)
@topics_bot was this, but it no longer works?
limitations:
- normal bots can't see previos messages posted before it was aded to the group
features:
* participants (count + main ppl)
* topics (words / phrases by rank(num))
* high level (conversation count?) (how?)
* other metadata:
- counts
- entity summary (links, media?)
* ignore noise? (shitposting?)
Forwarded from Somebody (+/%#)::'not j'
stylus-sumitup-2019-08-18.json
1.2 KB
User style to highlight #hash linkable headers and permalinks
TF-IDF based keyword extraction tool, together with a chrome extension, aiming at helping read web pages.
server side + chrome extension
https://github.com/wuyihao14/keyxtractor
server side + chrome extension
https://github.com/wuyihao14/keyxtractor
GitHub
ampresent/keyxtractor
TF-IDF based keyword extraction tool, together with a chrome extension, aiming at helping read web pages. - ampresent/keyxtractor
How Do You Classify Everything?
- Dewey Decimal Classification (DDC) (old library system, mostly US/Western)
Alternatives:
- Comparison of Dewey and Library of Congress subject classification (old, mainly US)
- Universal Decimal Classification (old, but perhaps better than dewey)
- Colon classification (old but interesting because facets)
- Bliss bibliographic classification (old, UK, books, more facets than colon)
- BSO - Broad System of Ordering (modern?)
# todo:
- more research, google scholar
- alternative ideas based on abstract/mathematical/universal facets
- binary trees?
#universal #classification #classify #categorize #category #catalogue
- Dewey Decimal Classification (DDC) (old library system, mostly US/Western)
Alternatives:
- Comparison of Dewey and Library of Congress subject classification (old, mainly US)
- Universal Decimal Classification (old, but perhaps better than dewey)
- Colon classification (old but interesting because facets)
- Bliss bibliographic classification (old, UK, books, more facets than colon)
- BSO - Broad System of Ordering (modern?)
# todo:
- more research, google scholar
- alternative ideas based on abstract/mathematical/universal facets
- binary trees?
#universal #classification #classify #categorize #category #catalogue
Wikipedia
List of Dewey Decimal classes
Wikimedia list article
Perhaps Zipf's law and the Pareto principle can be useful as a measure of quality.. in multiple ways.. https://youtu.be/fCn8zs912OE
#idea #suggestion
#idea #suggestion
YouTube
The Zipf Mystery
Support Vsauce, your brain, Alzheimer's research, and other YouTube educators by joining THE CURIOSITY BOX: a seasonal delivery of viral science toys made by Vsauce! A portion of all proceeds goes to Alzheimer's research and our Inquisitive Fellowship, a…