sumitup <ideas>
11 subscribers
2 photos
5 files
20 links
ideas box

Chat group: @sumitupchat

Bot: @sumitupbot
Download Telegram
Way for adding pepe the frog memes in article through gifs
Some way to embed location, map i mean
Ability to export all your articles as zip contanining all files
A reliable way to use markdown and html in post's content
presenting an article using pre-formatted template, useful for channels having a basic same content's information before every post.
This is a project on comprehensive data proxy to real-world knowledge for providing structured access to heterogenous web data from various sources clearly emphasizing simplicity and interoperability to make the webpages inspectable at ease.
TL;DR: Making consumption of web suitable for busy humans.
Just text? No! It will support media files like pictures and audio as well.
Coming soon.
This media is not supported in your browser
VIEW IN TELEGRAM
Forwarded from @Red
The idea I had is to send an already formatted text to the bot and automatically create the article with the text already formatted
📃Display a table of contents [toc] of the article, by extracting:
- Headings and subheadings (h1,h2,h2)
- Lists (ul, ol)
- Other hierarcghical html elements (dl, dd, table?)

Problems:
===
- Detect existing toc for articles that already have them (e.g. wikipedia) and
1. Ignore or
2. Reuse
- False positives (useless headings),
- maybe just check for headings inside <article> elements?)
- fuzzy matching by class or other attribute?
- try to use the xpath from IV competition
- quality metrics
- False negatives (missed headings or toc)
- article not organized (so quality metrics)
- headings not in h1, h2, etc.. tags
- headings nested badly (using all h1 and div for ex.)
https://www.textcompactor.com/
POST params:
textin: some text here
percentage: 16
Forwarded from 🤖 Bot Ideas ¦ Collab ✍️ (Channel Hash Bot)
#request #idea (user-)bot to summarize what goes on in a group.

@topics_bot was this, but it no longer works?

limitations:
- normal bots can't see previos messages posted before it was aded to the group

features:
* participants (count + main ppl)
* topics (words / phrases by rank(num))
* high level (conversation count?) (how?)
* other metadata:
- counts
- entity summary (links, media?)
* ignore noise? (shitposting?)
This media is not supported in your browser
VIEW IN TELEGRAM
Forwarded from Somebody (+/%#)::'not j'
stylus-sumitup-2019-08-18.json
1.2 KB
User style to highlight #hash linkable headers and permalinks
TF-IDF based keyword extraction tool, together with a chrome extension, aiming at helping read web pages.

server side + chrome extension

https://github.com/wuyihao14/keyxtractor
How Do You Classify Everything?

- Dewey Decimal Classification (DDC) (old library system, mostly US/Western)

Alternatives:
- Comparison of Dewey and Library of Congress subject classification (old, mainly US)
- Universal Decimal Classification (old, but perhaps better than dewey)
- Colon classification (old but interesting because facets)
- Bliss bibliographic classification (old, UK, books, more facets than colon)
- BSO - Broad System of Ordering (modern?)

# todo:
- more research, google scholar
- alternative ideas based on abstract/mathematical/universal facets
- binary trees?

#universal #classification #classify #categorize #category #catalogue