WaG - Word at a Glance

Use existing corpus/data search and retrieval software as backend,
Obtain and compile information about:
1. single word,
2. two or more words compared with each other,
3. word translation.
explore text metadata statistics, time-based trends, word cloud-based data and many more,
combine statistics from different corpora,
Use results of a resource as an input for other resource.

Currently supported resources

KonText
MQuery
NoSketch Engine
Treq
Clarin FCS Core 1
Datamuse API
Leipzig Corpora Collection (REST API) (LCC)

	WaG	KonText	MQuery	NoSkE	Treq	Clarin FCS	Datamuse	ElasticSearch	LCC
collocations		⭐	🚧	⭐					⭐
concFilter		⭐
concordance		⭐		⭐		⭐			⭐
freqBar		⭐		⭐
freqComparison		⭐		⭐
freqPie		⭐		⭐
geoAreas		⭐		⭐
multiWordGeoAreas		⭐		⭐
html		⭐		⭐
matchingDocuments		⭐						⭐
mergeCorpFreq		⭐		⭐
speeches		⭐
syntacticColls			⭐
timeDistrib		⭐		⭐
multiWordtimeDistrib		⭐		⭐
translations					⭐
treqSubsets					⭐
wordForms	⭐	⭐	🚧
wordFreq	⭐	⭐	🚧
wordSim	⭐		🚧				⭐		⭐

Requirements

WaG is able to run either as a self-hosted application or within a compatible web page. For the self-hosted variant the following is needed:

Node.JS + NPM package manager
HTTP proxy server (Nginx, HAProxy, Apache)
a core word frequency database:
- CouchDB (data can be generated from a corpus vertical file using CNC-MASM)
- KorpusDB (CNC's own service)
- KonText
- SQLite3

Please refer for more information to the INSTALL.md.

How to cite WaG

Tomáš Machálek (2020): Word at a Glance: Modular Word Profile Aggregator. In: Proceedings of LREC 2020, s. 7011–7016.

@InProceedings{machalek2020lrec,
 author = {Tomáš Machálek},
 title = "{Word at a Glance: Modular Word Profile Aggregator.}",
 booktitle = {Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020)},
 year = {2020},
 publisher = {European Language Resources Association (ELRA)},
 language = {english}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2,337 Commits
assets		assets
conf		conf
dist		dist
dockerfiles		dockerfiles
html		html
install		install
scripts		scripts
src		src
test		test
.dockerignore		.dockerignore
.env		.env
.env.dev		.env.dev
.gitignore		.gitignore
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
build.js		build.js
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
launcher-config.json		launcher-config.json
launcher-menu.json		launcher-menu.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.server.json		tsconfig.server.json
wdglance.sublime-project		wdglance.sublime-project
webpack.dev.js		webpack.dev.js
webpack.prod.js		webpack.prod.js
webpack.server.js		webpack.server.js

License

czcorpus/wag

Folders and files

Latest commit

History

Repository files navigation

WaG - Word at a Glance

Currently supported resources

Requirements

How to cite WaG

About

Topics

Resources

License

Stars

Watchers

Forks

Languages