Corpus Korpus PWr – statistics and info
Counts | |
---|---|
Tokens | 352997 |
Words | 277947 |
Sentences | 23505 |
Paragraphs | 11644 |
Documents | 1290 |
General info | |
---|---|
Language | Polish |
Encoding | UTF-8 |
Compiled | 02/11/2015 13:55:09 |
Tagset doc | Description |
Infolink | More info |
Lexicon sizes | |
---|---|
word | 60311 |
lexem | 27621 |
tag | 954 |
Structures and attributes
- doc 1290
-
date 10
2011-10-08 580 2011-08-12 311 1970-01-01 90 2012-07-26 71 2011-08-20 69 2011-08-19 65 2012-01-27 53 2012-02-23 40 2011-10-24 10 1858-01-01 1 -
subcorpus 12
wikipedia 360 kap 213 dap 102 blogi 99 dialog 90 proza dawna 87 urzędowe 80 stenogramy 78 ustawy 74 popularno-naukowe i podręczniki 64 proza_wspolczesna 34 techniczne 9 -
author 15
Mirosław Tomaszewski 18 Paweł Grabowski 11 Magdalena Zarębska 9 Tomasz Miller 6 Piotr Sobolczyk 6 Wojciech Charewicz 4 Gabriel Marczak 4 Katarzyna Agata Kiełbińska 3 Mateusz Droba 3 Artur Beling 3 Michał Semeń 2 Jacek Środa 1 Emilia Kaczmarek 1 Jan Murzy Tarak Buczacki 1 -
subject 156
-
title 1212
-
source 1254
-
id 1290
-
keywords 6282
-
- p 0
- s 0
- g 0