noe casas @noecasas - Twitter Profile

noecasas retweeted

11 months ago

La plataforma Langtern empra la IA per ajudar els professors a crear exercicis a partir de textos i vídeos en línia, que classifica per nivell de dificultat https://t.co/HEj7lWfqiZ

0

1

0

156

noe casas @noecasas

over 2 years ago

You can find the slides and jupyter notebooks of my talk at PyDayBcn "Intro to neural nets with PyTorch" at https://t.co/Rc1uFtmyMP @PyBCN

2

19

7

2

2K

noecasas retweeted

Marina Moro López @marinamorolopez

over 2 years ago

Hoy es el #PyDayBCN 👩🏻‍💻🐍 De momento aquí estoy aprendiendo un poquito de PyTorch gracias a @noecasas 🤭

1

14

2

0

1K

noe casas @noecasas

over 2 years ago

@erogol I was planning to get familiar with ggml at some point, but for now I have been just lurking and tracking its progress. There is already a TTS for ggml: https://t.co/J8n5Az2FMk . Maybe parts of it can be useful for a future "xtts.cpp"

1

0

39

Who to follow

Oscar Mañas

@oscmansan

Research scientist at @AIatMeta, PhD from @Mila_Quebec @UMontrealDIRO. Working on multimodal vision+language generation & evaluation. Català a Zúric.

IWSLT

@iwslt

The International Conference on Spoken Language Translation & SIGSLT. Join us for the 22nd edition of IWSLT on 31 July-1 Aug 2025, co-located with ACL!

about 3 years ago

@Azure will your prices reflect the same reduction on gpt-3.5-turbo? Will the new models be available via Azure Cognitive Services?

OpenAI

@OpenAI

about 3 years ago

GPT-4 and GPT-3.5 Turbo models in the API now support calling your custom functions, allowing the model to use tools you design for it. Also — reduced pricing & new model versions (including 16k context for 3.5 Turbo): https://t.co/dalfgEQ9k2

416

5K

1K

549

2M

2

0

44

noecasas retweeted

Python Barcelona @PyBCN

about 3 years ago

@nataliasirera The PyDataBCN 2023 has been incredible! We are grateful for all the organizers, volunteers, speakers, sponsors and assistants. See you at our next event!

PyBCN's tweet photo. @nataliasirera The PyDataBCN 2023 has been incredible!

We are grateful for all the organizers, volunteers, speakers, sponsors and assistants.

See you at our next event! https://t.co/17AFDXLaTE

0

14

6

0

2K

noecasas retweeted

Marta R. Costa-jussa @costajussamarta

about 3 years ago

We are pleased to share a multilingual extension to holistic bias dataset which allows to unveil demographic biases for languages at scale, see our first findings: https://t.co/Az8rYZdAaQ

0

8

1

0

655

noe casas @noecasas

over 3 years ago

@Baidu_Inc @BaiduResearch 连普通的ERNIE接口还不可以正确使用。一位百度代表给我说明了我现在不能购买机器资源是因为：由于他们现在机器资源有限，为了保证用户体验，ERNIE 3.0暂时不售卖了。请增添机器资源来正确提供ERNIE的服务。

0

1

0

378

noe casas @noecasas

over 3 years ago

@gdb • Longer context • Random seed parameter • Tiktoken in JS/wasm to count tokens correctly (https://t.co/yhAObbmKZQ) • Models tailored to specific languages, to use fewer tokens to represent the same text, e.g. for Chinese chars

0

49

noecasas retweeted

Langtern @langternapp

over 3 years ago

New version of Langtern with improved flashcards! • Auto-advance mode • Can go back to the previous cards • In Chinese, flashcards can play the word pronunciation • Pronunciation played automatically in auto-advance mode Great to review vocabulary while doing something else!

0

1

0

166

noe casas @noecasas

over 3 years ago

@ramsri_goutham @OpenAI To count tokens, I was previously relying on JS package gpt-3-encoder, recommended by OpenAI, which matches the tokenizer web tool. After switching to tiktoken, my estimation and the actually consumed tokens matched perfectly.

0

1

0

54

noe casas @noecasas

over 3 years ago

@ramsri_goutham @OpenAI I found out at https://t.co/wxoD3UHw6R after seeing large discrepancies in the actually consumed tokens and my estimation

1

0

46

noe casas @noecasas

over 3 years ago

The situation has improved a bit with the new cl100k_base tokenizer used in the new gpt-3.5-turbo models

0

1

0

86

noe casas @noecasas

over 3 years ago

@gdb: is in @OpenAI 's plans to mitigate this issue? (e.g. offering models that are trained on vocabulary that represents other scripts more efficiently). My case is with simplified Chinese, which has approx num tokens = 2 x num chars

Ramsri Goutham Golla

@ramsri_goutham

over 3 years ago

Most people don't understand @OpenAI GPT-3's tokenization and how expensive/inefficient it is to build a GPT-3 app in non-English! English: John - 1 token Telugu: స్తి - 12 tokens 🤯 Byte pair encoding divides a character like స్తి further! స్తి (12 tokens)= స్త + ి (8+4)

ramsri_goutham's tweet photo. Most people don't understand @OpenAI GPT-3's tokenization and how expensive/inefficient it is to build a GPT-3 app in non-English!

English: John - 1 token
Telugu: స్తి - 12 tokens 🤯

Byte pair encoding divides a character like స్తి further!

స్తి (12 tokens)= స్త + ి (8+4) https://t.co/vQEAMVdRvj

29

561

62

128

173K

1

2

0

162

noecasas retweeted

Langtern @langternapp

over 3 years ago

Do not hesitate to drop us a line at [email protected] if you have any doubts or feedback to share! 6/6

0

1

0

81

noecasas retweeted

Langtern @langternapp

over 3 years ago

You can also extract the whole vocabulary (by HSK level) of a video or an essay to preview it, and save it as flashcards to review later. 5/6

langternapp's tweet photo. You can also extract the whole vocabulary (by HSK level) of a video or an essay to preview it, and save it as flashcards to review later. 5/6 https://t.co/H4ricLSizt

1

0

1

0

110

noecasas retweeted

Langtern @langternapp

over 3 years ago

And the new main feature: the online content search engine, where you can search videos and essays by keywords/content but also by HSK level and duration/length! Check it out! 4/6

langternapp's tweet photo. And the new main feature: the online content search engine, where you can search videos and essays by keywords/content but also by HSK level and duration/length! Check it out! 4/6 https://t.co/UfI0qgITbS

1

0

1

0

59

noecasas retweeted

Langtern @langternapp

over 3 years ago

In the Online Content, we have also added social media accounts in Chinese from Twitter and Weibo, web novels, short stories and podcasts with transcription. 3/6

langternapp's tweet photo. In the Online Content, we have also added social media accounts in Chinese from Twitter and Weibo, web novels, short stories and podcasts with transcription. 3/6 https://t.co/V4P7W7bZhQ

1

0

1

0

88

noecasas retweeted

Langtern @langternapp

over 3 years ago

The Online Content section now has more videos in Chinese from Youtube and now also from Bilibili, organized by category and channel. With an improved popup dictionary for captions! 2/6

1

0

1

0

60

noe casas

@noecasas

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users