Abhishek Shivkumar @abhisemweb - Twitter Profile

abhisemweb retweeted

Polymarket Money

@PolymarketMoney

about 1 month ago

Trump says $IBM is “going to go up a lot more.”

89

2K

146

297

817K

abhisemweb retweeted

Jean P.D. Meijer ― 🇪🇺 eu/acc

@initjean

about 1 month ago

Half of all Claude Code subscriptions could be completely wiped out in the next 6-12 months, predicts Anthropic CEO Dario Amodei.

initjean's tweet photo. Half of all Claude Code subscriptions could be completely wiped out in the next 6-12 months, predicts Anthropic CEO Dario Amodei. https://t.co/Zig2lAFKnT

146

7K

293

801

2M

Abhishek Shivkumar @abhisemweb

over 1 year ago

We are proud to sponsor LSX World Congress, taking place on 28-30 April 2025. Join us to access strategic knowledge and form new partnerships. #LSXWorld https://t.co/RPv9I6iwxa

0

20

Abhishek Shivkumar @abhisemweb

almost 3 years ago

“Finetuning excels in its ability to adapt an LLM’s behaviour to specific nuances, tones, or terminologies. If we want the model to sound more like a medical professional, write in a poetic style, or use the jargon of a specific industry…” — @HeikoHotz https://t.co/UjhMBTPsyl

0

1

0

95

Who to follow

Nyereka Tech

@nyerekatech

Best quality of #education accessable to all. #IoT, #STEM, #Robotic and #Micontroller experimental learning materials 📞(+250)-780-690-502

John P

@jplehmann

Founded @MorphMarket (exited '23) | AI/NLP since '01 | Independent Investor

François Scharffe

@lechatpito

Data Chef. [email protected]

abhisemweb retweeted

Santiago

@svpino

about 3 years ago

How can you solve complex tasks using a Large Language Model? Here is a 2-minute introduction to everything you need to know to 10x the quality of your results. Let's talk about three techniques, in order of complexity, starting with the easiest one: • In-Context Learning • Indexing + In-Context Learning • Fine-tuning In-Context Learning The team that trained GPT-3 found something they couldn't explain: You can condition a model using examples of how you want it to behave. I included an example prompt in the attached video. You can "teach" the model how you want it to interpret questions, select the correct answers, and format the results by giving a few examples. You can also give specific knowledge to the model that will be helpful when formulating answers. We call this approach "grounding the model." There's another example in the video. Indexing + In-Context Learning Unfortunately, there is a limit to how much data you can include in a prompt. We call this the "context size." One version of GPT-4 supports a context of approximately 6,000 words, while the other supports 25,000 words. Although this sounds like a lot, many applications need more than that. Imagine you wrote a book and want to build an application to answer any questions about your story. What happens if your book is longer than the context? That's where Indexing comes in. Using a model, you can turn every book passage into an embedding. These are vectors, numbers that "encode" the passage's text. You can then store these embeddings in a particular database that supports fast retrieval of these vectors. You can then turn any question into an embedding and search the database for the list of passages that are similar to that query. Instead of using the entire book to ask the model, you can now use the relevant passages as in-context information, effectively working around the context size limitation. Fine-tuning Fine-tuning can give you an extra boost to get reliable outputs from your LLM. It is, however, the most complex approach on the list. There are different approaches to fine-tuning a model with your data. A popular technique is to process your data with your LLM and use the outputs to train a new classifier that solves your specific task. Notice that here you aren't modifying the LLM. Instead, you are chaining it with your trained classifier. Another approach is to modify the parameters of the LLM using your data. Think of this as "rewiring" the model in a way that solves your particular task. The results and costs will vary depending on how many layers you want to fine-tune from the original model. Many companies think that fine-tuning is the solution to their problems. In my experience, many will benefit from exploring the other two approaches. I love explaining Machine Learning and Artificial Intelligence ideas. If you enjoy in-depth content like this, follow me @svpino so you don't miss what comes next.

61

1K

187

2K

384K

abhisemweb retweeted

Jay Alammar

@JayAlammar

almost 6 years ago

How GPT3 works. A visual thread. A trained language model generates text. We can optionally pass it some text as input, which influences its output. The output is generated from what the model "learned" during its training period where it scanned vast amounts of text. 1/n

35

2K

731

835

0

abhisemweb retweeted

elvis

@omarsar0

about 3 years ago

Open-source ML is at it again! Databricks just released Dolly 2.0! Here's what you need to know: - This model is a 12B parameter language model based on EleutherAI Pythia model family. - It's fine-tuned on 15K high-quality human-generated prompt/response pairs (crowdsourced among Databricks employees) for instruction tuning LLMs. - Dolly 2.0 is open-sourced, including training code, dataset, and model weights. - The best part is that it's suitable for commercial use! This is one of the big limitations of previous instruction-following models like Alpaca, Koala, GPT4All, and Vicuna. Model weights: https://t.co/yUj5XKCdVU Dataset: https://t.co/DgI70balUp Blog: https://t.co/9tPgynbkR7

17

914

184

584

220K

Abhishek Shivkumar @abhisemweb

over 3 years ago

@Lexy_Hodge back up now

0

21

abhisemweb retweeted

Ben Tossell

@bentossell

over 3 years ago

The best threads on what’s happening in AI:

30

729

117

954

0

abhisemweb retweeted

Afiz ⚡️

@itsafiz

over 3 years ago

Study Deep Learning for Free from MIT MIT's introductory course on deep learning methods with applications in computer vision, language, and more! Course Link: https://t.co/6sMlJS1Pc8

itsafiz's tweet photo. Study Deep Learning for Free from MIT

MIT's introductory course on deep learning methods with applications in computer vision, language, and more!

Course Link: https://t.co/6sMlJS1Pc8 https://t.co/685oH2gVgR

9

541

146

437

0

abhisemweb retweeted

Mark Tenenholtz

@marktenenholtz

over 3 years ago

Computer vision is coming back into the forefront with Stable Diffusion. But if you're totally new to CV, you've gotta get started somewhere. No matter your skill level, here’s my favorite computer vision course. (And, of course, it’s 100% free from University of Michigan!)

marktenenholtz's tweet photo. Computer vision is coming back into the forefront with Stable Diffusion.

But if you're totally new to CV, you've gotta get started somewhere.

No matter your skill level, here’s my favorite computer vision course.

(And, of course, it’s 100% free from University of Michigan!) https://t.co/1gkFOWEjD3

10

748

145

647

0

abhisemweb retweeted

Mastodon (@[email protected])

@joinmastodon

over 3 years ago

Mastodon has just passed over 2 million active monthly users, a new record! People are voting with their feet. The future of social media doesn't have to belong to a billionaire, it can be in the hands of its users.

1K

35K

6K

597

0

Abhishek Shivkumar @abhisemweb

over 3 years ago

A nice article on #mlops, #datadrift and other concepts - https://t.co/8p7FUPsubi

0

1

0

abhisemweb retweeted

Sharif Shameem

@sharifshameem

over 3 years ago

A few more photorealistic samples from the new Lexica model.

66

1K

103

235

0

Abhishek Shivkumar @abhisemweb

over 3 years ago

@Scobleizer @Scobleizer , your lists are awesomely curated. Thanks for that. Saved a ton of time.

0

1

0

abhisemweb retweeted

𝙷𝚒𝚖𝚊 𝙻𝚊𝚔𝚔𝚊𝚛𝚊𝚓𝚞

@hima_lakkaraju

over 3 years ago

Excited to share that my day-long workshop (a short course) on #ExplainableAI is now publicly available as a five-part youtube video lecture series. Link to video lectures: https://t.co/n0pKBbGByw Link to slides: https://t.co/UzUQeCXUox #AI #ML @trustworthy_ml @XAI_Research

hima_lakkaraju's tweet photo. Excited to share that my day-long workshop (a short course) on #ExplainableAI is now publicly available as a five-part youtube video lecture series.

Link to video lectures: https://t.co/n0pKBbGByw
Link to slides: https://t.co/UzUQeCXUox

#AI #ML @trustworthy_ml @XAI_Research https://t.co/OhUSib9FLM

22

2K

415

1K

0

abhisemweb retweeted

Michael Kennedy @mkennedy

over 3 years ago

Excited about #Python 3.11? Me too! I made a video to give you the major highlights! Python 3.11 in 100 seconds: https://t.co/chwcgrvL9U

3

95

21

19

0

Abhishek Shivkumar @abhisemweb

over 4 years ago

Really? Human level performance?

0

abhisemweb retweeted

Harsh Makadia

@MakadiaHarsh

over 4 years ago

Most movies are boring. But the right ones will blow your mind. Here is a list of 10 movies that a programmer cannot miss:

107

3K

684

2K

0

Abhishek Shivkumar @abhisemweb

over 4 years ago

@VueHelp Folks, the movie got over long back ... Much before you replied. I was not able to connect to a human on your support line because your speech recognition engine didn't recognize the movie name I was saying. I remember repeating atleast 30 times.

1

0

Abhishek Shivkumar @abhisemweb

over 4 years ago

@vuecinemas trying to contact you and your speech to text is completely failing. What tech do you use ?

1

0

Abhishek Shivkumar

@abhisemweb

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users