Tag: corpora

24 Lists of linguistic resources 2011-09-27T17:42:09.257

19 Corpus of baby-talk or motherese 2011-09-16T08:30:48.160

16 English text corpus for download 2013-08-22T10:49:48.407

15 Does anyone know of text message corpora? 2012-02-13T16:58:20.213

14 Are there natural languages that do not obey Zipf's law? 2014-01-10T18:32:55.637

11 What are some resources that I can use to gather Twitter data for an NLP project? 2011-10-03T20:16:52.787

10 The power of trigram language models (2nd order Markov models) 2012-10-05T21:03:14.467

10 Best method for building a learner corpus for DDL 2013-06-03T22:46:35.687

9 How to work on annotating AND sentence-aligning parallel texts? 2012-01-16T00:52:43.320

9 Is there an automatic way of identifying transitive verbs in Computational Linguistics? 2012-04-13T23:30:31.060

9 What is the Relationship Between Document Length and Unique Words 2012-12-10T21:08:22.500

8 Where to find frequency list of English words from newspapers, books and magazines? 2012-06-17T16:05:30.387

8 Tools to annotate (categorise) sentences from a sentence corpus 2013-01-20T12:49:35.570

7 Are there dictionaries like Collins COBUILD for other languages than English? 2011-09-17T14:18:29.277

7 Corpus of Chat/IM/Text Conversations? 2014-04-28T19:22:31.053

7 Do I have copyright issues when making a corpus from the web? 2014-10-17T00:35:41.710

7 How can I calculate if the difference between two word frequencies in one corpus is significant? 2016-11-27T17:32:36.930

6 Where could I find a corpus that is purely descriptive in nature and limited in scope? 2011-09-15T20:52:29.403

6 Is there a digital corpus somewhere of pre-Latin Vietnamese text? 2012-12-22T02:06:29.560

6 Corpus Linguistics: Is it possible to add a tag for "sentence ending"? 2016-10-11T16:04:41.787

5 Words and phrases more likely in everyday speech 2013-03-10T18:29:32.980

5 Semantic frame representation of ATIS 3 corpora 2013-04-21T20:04:10.940

5 How to establish word frequencies from a corpus? 2013-09-12T08:09:34.213

5 Tool for building a corpus by crawling the web? 2013-12-17T04:01:07.067

5 Ranking sentences 2014-01-05T13:20:50.087

5 How should grammatical errors be treated in parsing and tokenisation? 2014-10-26T13:47:53.167

5 Parts-of-speech tagging and finding relevant phrases in documents 2016-04-27T23:49:00.200

5 How can I automatically build a domain specific corpus from scratch? 2016-12-19T11:17:17.793

5 Is there a corpus of Arabic text that doesn't include quotations of the Quran? 2017-01-20T22:45:03.163

5 Where can I find training data for dialects of Hindi? 2017-04-04T05:21:41.530

5 Tools and algorithms for finding non-verbatim quotations 2018-01-05T12:03:08.770

4 How to decrease CRF++ feature function set? 2011-11-12T17:01:52.587

4 Is there a standard corpus against which to benchmark mechanical parsers? 2012-03-14T07:29:12.193

4 Corpus-making guide in Language Understanding Context 2012-10-24T19:12:49.153

4 Tool for manually POS tagging texts 2012-10-29T20:02:42.917

4 A classical text about the Sun? 2012-12-22T08:31:02.197

4 Database of synonym gradients 2013-01-01T19:12:38.160

4 Corpora of Indigenous American Languages? 2013-04-03T04:54:42.960

4 Automated methods to align text 2013-04-03T09:11:48.797

4 Word list sought based on corpus of 19th century scientific English 2013-08-12T21:53:59.303

4 Is there a corpus of English adjective, adverb, etc metadata? 2013-11-27T06:16:47.837

4 Is there any corpus for technical English? (E.g., computers, IT, modern technology) 2014-12-16T09:24:12.053

4 Looking for korean text corpus 2015-11-14T13:55:42.447

4 Distribution of the set of meanings of a given word, in a corpus 2016-10-04T09:41:21.570

4 Classifying the verbs in a small corpus 2017-04-11T15:02:32.093

4 Where can I find a free text corpus for the Hindi language? 2017-06-22T10:39:33.770

3 Word frequency list for agglutinative languages like Swahili? 2012-08-01T19:53:36.877

3 Dimensional reduction with synonomy and polysemy 2012-08-08T17:44:47.147

3 Is there a computational method to syllabify English words? 2012-09-06T04:18:44.930

3 Getting familiar with accents 2013-01-07T14:19:29.510

3 Symbols that compose a word 2013-09-23T11:39:44.460

3 How are dictionaries produced 2013-10-06T19:26:24.897

3 Where can I find Japanese-English (manually) word-aligned corpora? 2013-11-25T13:18:59.120

3 Corpus analysis program 2013-11-29T13:38:10.827

3 Is there a web-based corpus tools that I can upload and use with my own corpus? 2014-05-10T11:52:50.707

3 Is there any corpus for idioms? 2014-06-14T23:26:34.237

3 Probabilities for 2-grams are higher than 1-grams in arpa file produced by kenlm 2014-07-20T02:35:48.663

3 Time annotated corpus: plain text english corpus 2014-09-29T16:26:30.313

3 Conversational English corpus for download 2015-03-04T01:19:50.597

3 Different discounting methods with SRILM toolikt 2015-04-06T08:53:09.940

3 Question type corpus 2015-06-20T23:19:42.410

3 Slot filling corpora 2015-08-24T13:04:42.990

3 American English Corpus with spoken language exchange 2015-11-07T10:17:13.843

3 Are there any publicly available spell checking corpora? 2016-05-27T07:46:47.147

3 Corpus Linguistics: How do I compare date from two corpora correctly? 2016-11-16T23:32:51.600

3 where can I find free corpus of spoken disabled people (in english, italian) 2018-01-10T20:29:29.617

3 Phonetically annotated speech corpus 2018-02-14T17:54:45.803

3 glossary/dictionary corpus for NLP task 2018-02-22T16:58:34.233

3 Is there a frequency-ranked corpus of Punjabi NAV lemmata? 2018-02-22T17:30:43.060

2 Query format for NP without subject in PPCEME 2012-04-15T11:53:45.050

2 Do generative linguists use spoken-word corpora? 2013-03-29T03:03:26.983

2 Generalisations which a bi-gram probabilistic model might infer from a dataset 2013-05-18T12:39:27.243

2 Online bigram frequency lookup 2013-05-24T11:38:25.977

2 Annotation agreement for multiple category assignment 2013-08-25T08:19:33.420

2 Semi-automatic annotation tool like '@nnotate' 2013-11-05T15:16:06.417

2 I am looking for an Arabic ngram corpus 2014-02-07T13:33:18.087

2 Comparing frequencies of two corpora 2014-05-25T08:28:07.627

2 What are the subsets of the Eijiro corpus about? 2014-06-29T14:47:40.030

2 What is considered the smallest possible sample size for word frequency lists used in FL instruction? 2014-08-21T17:38:47.437

2 How to determine difficulty of a word if its frequency in a corpus is known? 2014-12-06T01:29:52.137

2 Biggest freely available English corpus? 2015-05-10T20:26:36.173

2 Do standard corpus analysis tools like AntConc and Wordsmith work for foreign languages? 2015-07-11T21:19:10.717

2 What are some canonical or seminal corpus linguistic studies using the Brown Corpus? 2015-08-27T16:43:58.313

2 Differences in collective nouns and agreement between Ame and BrE 2016-04-05T12:09:57.687

2 How to get data for ACADEMIC + SPOKEN (Academic speech) simultaneously using BNC? 2016-04-22T15:09:05.273

2 I am building a chatbot and I need corpora 2016-05-06T03:02:26.170

2 Is the Philippine corpus known as Palito still available for download online? 2016-05-31T21:29:23.333

2 Open corpora for modern English 2016-07-14T14:08:13.570

2 Where to find/purchase an offline (downloadable) Arabic corpus 2016-07-28T10:58:25.020

2 Comparative Methodology 2016-08-11T17:43:41.643

2 Phonetics - English Pronunciation of Vowels Corpus 2016-09-27T19:52:32.200

2 building a corpus from COCA KWIC 2017-01-21T10:51:53.750

2 Command line tools for querying corpuses 2017-07-26T17:42:17.990

1 What is the formula for Usage Rate? 2012-12-20T02:19:03.020

1 How is the dependency of corpus size and observed ngrams? 2013-09-12T11:55:03.910

1 What are lexical and morpho-syntactic alternations? 2014-08-18T12:36:05.237

1 Is there a data set of elementary typical phrases translated in different languages? 2014-09-01T12:49:55.480

1 Is it possible to build thesaurus automatically? 2015-01-16T08:51:40.257

1 Evaluate idea to autobuild russian-english parallel corpus 2015-01-26T07:53:08.457