Top Tweets for #parallelcorpus
A fanfare please! OpenSubtitles corpora now star as the collection with the highest number of parallel corpora in Sketch Engine. 60 languages and language varieties from all over the world, 11 billion words in total.
https://t.co/ucMwDteere
#parallelcorpus #corpuslinguistics

We're introducing the new United Nations Parallel Corpus. It consists of the official records of the UN in 6 langs: Arabic, Chinese, English, French, Russian and Spanish. Available to both trial and paying subscribers.
#corpuslinguistics #parallelcorpus
https://t.co/6AJd39PfB8

Try parallel corpora in the open version of Sketch Engine. We published four samples of parallel corpora there.
To get more, sign up for an account and gain access to parallel corpora in 40+ languages.
#corpuslinguistics #parallelcorpus

We almost forgot to tell you, ParaCrawl 8 is out!
First highlight: wow the size of it!
Check yourself at https://t.co/XVwya7g3y7
#ParaCrawl #crawling #parallelcorpus #CEFTelecom #MT

Cleaned with a newer and better version of Bicleaner with three new language pairs. Now also with anonymised version of corpora, ParaCrawl v7 is now available.
https://t.co/ZI6GBmiWwM
#ParaCrawl #crawling #parallelcorpus #CEF #MT
@inea_eu
Read more about @ParaCrawl in our ACL paper https://t.co/89x1vk5vc9 and join the live Q&A session on 7th July at 1400 CEST https://t.co/pcdanDeycd or at 1900 CEST https://t.co/AmkTyka0EP
#ParaCrawl #crawling #parallelcorpus #CEF #MT
@aclmeeting @inea_eu
New language pair English-Icelandic is part of ParaCrawl v6. Also, more data for other languages with restorative cleaning, improved sentence splitting, fixes to encoding and html issues and more.
https://t.co/rhVN4DHwaY
#ParaCrawl #crawling #parallelcorpus #CEF #MT
@inea_eu
Philipp Koehn will talk about the approach to data ownership in web-scale crawling projects such as @Paracrawl in today's @T21Century MT webinar. https://t.co/TwWAUkWArl
#ParaCrawl #crawling #parallelcorpus #CEF #MT
ParaCrawl v5.1 builds upon the same raw corpus as v5, but thanks to filtering improvements its higher in quantity and quality. The official release for #WMT20
https://t.co/ovfEGeJIwd
#ParaCrawl #crawling #parallelcorpus #CEF #MT @inea_eu @emnlp2020
Your 4 mins to learn how to use parallel corpora in Sketch Engine - new video #corpuslinguistics #corpus #parallelcorpus What else would you like to learn? https://t.co/yRhM1nLbG0
ParaCrawl v5 is twice in size than the v4. https://t.co/FWY9QyeBHL
#ParaCrawl #crawling #parallelcorpus #CEF #MT
#InternetArchive partners with #UniversityOfEdinburgh to provide historical web data supporting #MT that is vastly expanding the data mined by #ParaCrawl and therefore the amount of translated sentences collected.
https://t.co/ilcI29naMW
#crawling #parallelcorpus #CEF
We have just released two bonus corpora for language pairs #PolishGerman and #DutchFrench, crawled in collaboration with an industry partner. For more details visit: https://t.co/mhAtFpd46f
#ParaCrawl #crawling #parallelcorpus #CEF
Last Seen Hashtags on Sotwe
Most Popular Users

Elon Musk 
@elonmusk
240.5M followers

Barack Obama 
@barackobama
119.3M followers

Donald J. Trump 
@realdonaldtrump
111.7M followers

Cristiano Ronaldo 
@cristiano
110.3M followers

Narendra Modi 
@narendramodi
107M followers

Rihanna 
@rihanna
97.6M followers

NASA 
@nasa
92.1M followers

Justin Bieber 
@justinbieber
90.8M followers

KATY PERRY 
@katyperry
87.5M followers

Taylor Swift 
@taylorswift13
81.3M followers

Lady Gaga 
@ladygaga
72.9M followers

Kim Kardashian 
@kimkardashian
69.7M followers

Virat Kohli 
@imvkohli
69.6M followers

YouTube 
@youtube
68.7M followers

Bill Gates 
@billgates
63.8M followers

The Ellen Show
@theellenshow
62.5M followers

Neymar Jr 
@neymarjr
62.3M followers

CNN 
@cnn
61.9M followers

X 
@x
60.8M followers

Selena Gomez 
@selenagomez
60.6M followers


