Subham De @SubhamDe2021 - Twitter Profile

Subham De @SubhamDe2021

25 days ago

RT @antgoldbloom: I've spent a good chunk of the last two years seeing how different companies approach account scoring. I've noticed that…

0

1

0

25

SubhamDe2021 retweeted

Anthony Goldbloom @antgoldbloom

over 1 year ago

Looking at job posts over the last year, it looks like Amazon is making the broadest AI investment. 228 teams across the company posted jobs with GenAI projects over the past year, considerably more than any other company. https://t.co/aOYjKfD6OJ Most interest is the breadth of teams and use cases. They made posts from the core research teams, Alexa, customer service, seller experience, fulfillment, and catalog selection teams (full list: https://t.co/1UnE6iY9IE). Some examples: - Creative X Team: generating high-quality text, images and video for advertisers (https://t.co/ymi1eMXrba) - Geospatial Science Team: for global address parsing and validation (https://t.co/UtSm8AX6DD) - Selection and Catalog Systems Team: improve the completeness and correctness of product data for Amazon shoppers (https://t.co/3Ll3SYFYli)

antgoldbloom's tweet photo. Looking at job posts over the last year, it looks like Amazon is making the broadest AI investment. 228 teams across the company posted jobs with GenAI projects over the past year, considerably more than any other company. https://t.co/aOYjKfD6OJ

Most interest is the breadth of teams and use cases. They made posts from the core research teams, Alexa, customer service, seller experience, fulfillment, and catalog selection teams (full list: https://t.co/1UnE6iY9IE).

Some examples:
- Creative X Team: generating high-quality text, images and video for advertisers (https://t.co/ymi1eMXrba)
- Geospatial Science Team: for global address parsing and validation (https://t.co/UtSm8AX6DD)
- Selection and Catalog Systems Team: improve the completeness and correctness of product data for Amazon shoppers (https://t.co/3Ll3SYFYli)

1

47

16

24

17K

SubhamDe2021 retweeted

Anthony Goldbloom @antgoldbloom

over 1 year ago

The overlooked GenAI use case: cleaning, processing, and analyzing data. https://t.co/j9hFXWJvvF Job post data tell us what companies plan to do with GenAI. The most common use case is data analytics projects. Examples: - AstraZeneca: using LLMs on freeform documents to structure results from their Extractables & Leachables testing (https://t.co/wbamJnQEEN) - Trafigura: The Document AI team is using LLMs to extract data from a corpus of commodity trading documents to generate credit reports (https://t.co/mujn0ksa90) The startup ecosystem is overlooking this use case, instead focusing on other areas such as customer support, sales & marketing and code gen.

antgoldbloom's tweet photo. The overlooked GenAI use case: cleaning, processing, and analyzing data.
https://t.co/j9hFXWJvvF

Job post data tell us what companies plan to do with GenAI. The most common use case is data analytics projects. Examples:

- AstraZeneca: using LLMs on freeform documents to structure results from their Extractables & Leachables testing (https://t.co/wbamJnQEEN)

- Trafigura: The Document AI team is using LLMs to extract data from a corpus of commodity trading documents to generate credit reports
(https://t.co/mujn0ksa90)

The startup ecosystem is overlooking this use case, instead focusing on other areas such as customer support, sales & marketing and code gen.

22

641

112

795

165K

Subham De @SubhamDe2021

almost 2 years ago

Apply using the links shared below, or if you know talented engineers or data people who could be a great fit, I would appreciate referrals. https://t.co/K1LNzIekq3 https://t.co/SfvlZquPVO 📷

0

142

Who to follow

Ram Bharadwaj

@arbdwj

AI safety research fellow @lasrlabs. Prev @lossfunk

AIadvocate

@aiadvocateorg

🤖💼 AiAdvocate: Connecting cutting-edge AI startups with future-focused clients for seamless growth & innovation. Your success, our mission! #AI ✨

Big Y

@smalll_y

reloadsol and Givingback Ninja

Subham De @SubhamDe2021

almost 2 years ago

We're hiring several backend/data/ML engineers for our new(ish) company Sumble, which is focused on building high-quality structured data from raw, noisy inputs.

2

6

0

1

243

Subham De @SubhamDe2021

almost 2 years ago

We have a founding engineering team of 7, which we built over the past year. We have funding, revenue, and users, and we just hit milestones that make us want to move faster.

1

0

145

Subham De @SubhamDe2021

over 2 years ago

@khandelia1000 base model, not the instruct one.

1

0

61

Subham De @SubhamDe2021

over 2 years ago

I've seen many reports of people struggling to finetune Gemma. We are using Alpaca-style instruction formats over conversational style and we are now getting superior performance out of Gemma-2b compared with Mistral-7b for our NER task (see screenshot to see our task). The approach was inspired by @maximelabonne https://t.co/E3hJh0alLO

SubhamDe2021's tweet photo. I've seen many reports of people struggling to finetune Gemma. We are using Alpaca-style instruction formats over conversational style and we are now getting superior performance out of Gemma-2b compared with Mistral-7b for our NER task (see screenshot to see our task).

The approach was inspired by @maximelabonne https://t.co/E3hJh0alLO

Maxime Labonne

@maximelabonne

over 2 years ago

💎 Gemma + Alpaca > chat model The chat model (gemma-2b-it) looks pretty bad, so I ran a quick experiment and retrained gemma-2b on the Alpaca dataset. It's only 52k samples but it already shows better performance on Nous' benchmark suite. Looks quite promising for the Gemma models. 🤗 Model: https://t.co/96I7V8zoCU 📊 Leaderboard: https://t.co/GnqMupLIQR

maximelabonne's tweet photo. 💎 Gemma + Alpaca > chat model

The chat model (gemma-2b-it) looks pretty bad, so I ran a quick experiment and retrained gemma-2b on the Alpaca dataset.

It's only 52k samples but it already shows better performance on Nous' benchmark suite. Looks quite promising for the Gemma models.

🤗 Model: https://t.co/96I7V8zoCU
📊 Leaderboard: https://t.co/GnqMupLIQR

5

121

19

40

17K

6

128

22

159

63K

Subham De @SubhamDe2021

over 2 years ago

@romechenko @karpathy @Yampeleg we have had success with it here for a NER task: https://t.co/3ODVdG6jLY

Subham De @SubhamDe2021

over 2 years ago

I've seen many reports of people struggling to finetune Gemma. We are using Alpaca-style instruction formats over conversational style and we are now getting superior performance out of Gemma-2b compared with Mistral-7b for our NER task (see screenshot to see our task). The approach was inspired by @maximelabonne https://t.co/E3hJh0alLO

6

128

22

159

63K

0

44

Subham De @SubhamDe2021

over 2 years ago

@cclark @antgoldbloom A mix of manual labels + GPT-4 labels

0

1

0

46

Subham De @SubhamDe2021

over 2 years ago

@ddhuan88 @antgoldbloom vLLM

0

783

Subham De @SubhamDe2021

over 2 years ago

@EkanshVerma12 We are using a set-up similar to UniNER project (https://t.co/gTwGuQ9wEd)

0

7

0

8

590

Subham De @SubhamDe2021

over 2 years ago

@shreydan We started with an encoder-only model, Longformer. But we get better performance with LLMs.

0

6

0

1

790

Subham De @SubhamDe2021

over 2 years ago

A conversation style tuning format as adopted in UniNER(https://t.co/X6TrXcJnQG) project seems to confuse Gemma. We use an instruction format instead as shown in the attached screenshot below.

SubhamDe2021's tweet photo. A conversation style tuning format as adopted in UniNER(https://t.co/X6TrXcJnQG) project seems to confuse Gemma. We use an instruction format instead as shown in the attached screenshot below. https://t.co/c7NW0o9Ddk

1

3

0

3

1K

Subham De @SubhamDe2021

over 2 years ago

We assume inference cost improvements are because of the smaller model size. And assume increased performance on Asian languages are due to Gemma’s vast vocabulary which leads to fewer tokens for Asian languages(Gemma: 158 vs. Mistral: 272 for Japanese text in previous screenshot). Denser inputs and better representation in the transformer as pointed out by @karpathy ( https://t.co/IOai7MiQbH)

Andrej Karpathy

@karpathy

over 2 years ago

New (2h13m 😅) lecture: "Let's build the GPT Tokenizer" Tokenizers are a completely separate stage of the LLM pipeline: they have their own training set, training algorithm (Byte Pair Encoding), and after training implement two functions: encode() from strings to tokens, and decode() back from tokens to strings. In this lecture we build from scratch the Tokenizer used in the GPT series from OpenAI.

karpathy's tweet photo. New (2h13m 😅) lecture: "Let's build the GPT Tokenizer"

Tokenizers are a completely separate stage of the LLM pipeline: they have their own training set, training algorithm (Byte Pair Encoding), and after training implement two functions: encode() from strings to tokens, and decode() back from tokens to strings. In this lecture we build from scratch the Tokenizer used in the GPT series from OpenAI.

350

14K

2K

7K

2M

1

6

0

2K

Subham De @SubhamDe2021

over 3 years ago

Google's #ChatGPT

Sundar Pichai

@sundarpichai

over 3 years ago

1/ In 2021, we shared next-gen language + conversation capabilities powered by our Language Model for Dialogue Applications (LaMDA). Coming soon: Bard, a new experimental conversational #GoogleAI service powered by LaMDA. https://t.co/cYo6iYdmQ1

659

14K

3K

1K

7M

1

0

459

SubhamDe2021 retweeted

andrew gao

@itsandrewgao

over 3 years ago

What if you could talk to the Bible? Try now: https://t.co/sL7dtiyneD Describe your situation or ask a question! Cites specific verses from the Bible. Works in English and Spanish. Made with @OpenAI and domain via @Porkbun! #biblegpt #chatgpt #gpt3

itsandrewgao's tweet photo. What if you could talk to the Bible?

Try now: https://t.co/sL7dtiyneD

Describe your situation or ask a question! Cites specific verses from the Bible. Works in English and Spanish.

Made with @OpenAI and domain via @Porkbun!
#biblegpt #chatgpt #gpt3 https://t.co/sykRsIVtu5

140

2K

211

726

651K

Subham De

@SubhamDe2021

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users