Pan @E_tailpp - Twitter Profile

E_tailpp retweeted

3 months ago

Free Maxar/Vantor Satellite Imagery for Disaster Response - QGIS and MapLibre Plugins Demo Want to download free high-resolution satellite imagery in QGIS? In this tutorial, I’ll show you how to access and visualize imagery from the Vantor (formerly Maxar) Open Data Program using two powerful plugins, no programming required. The Vantor Open Data Program provides free satellite imagery to support global disaster response and recovery efforts. These plugins make it easy to search, filter, preview, and download imagery directly inside QGIS or in web applications built with MapLibre GL. MapLibre GL Plugin GitHub: https://t.co/kpUPpY9IBt Live Demo: https://t.co/FT4PwKKc8Q QGIS Plugin: GitHub: https://t.co/ECRrscDsS5 Plugin Page: https://t.co/QBHgBFoxyj Vantor Open Data Program: https://t.co/KJhhiwkCcf #QGIS #Maxar #SatelliteImagery #DisasterResponse #OpenSource

3

172

37

181

11K

E_tailpp retweeted

Sharut Gupta @sharut_gupta

4 months ago

[1/n] Do distinct large models admit a simple map that aligns their embedding spaces? We show that across multimodal contrastive models—trained on different data and architectures—an orthogonal map aligns image embeddings. Strikingly, the same map also aligns text embeddings.

12

441

62

364

37K

Pan @E_tailpp

5 months ago

@denisewu @SteveYates because the Lai have bought much offensive weapons

0

4

0

617

E_tailpp retweeted

Lucas Beyer (bl16)

@giffmana

7 months ago

SAM3 seems pretty dope. And then they also released another variant where you can take the segmented thing and turn it into a 3D model!

8

368

18

99

31K

E_tailpp retweeted

Astrid Wilde 🌞

@astridwilde1

7 months ago

Well done to the SAM3D team. I haven't benchmarked it properly yet but this is definitely going to be the default backbone of choice for all 3d tasks by the back half of next year in the literature. What a time to be alive 🌞

37

3K

180

1K

164K

E_tailpp retweeted

Marco Franzon

@mfranz_on

7 months ago

SAM3 for robotics will make your robot recognize everything.

2

117

9

56

7K

Pan @E_tailpp

7 months ago

@DrewPavlou @grok is ture？

0

860

Pan @E_tailpp

8 months ago

@gabriberton @grok explain

1

0

346

Pan @E_tailpp

8 months ago

@Rumoreconomy 人家就没答应和他见面好吧

0

145

0

37K

Pan @E_tailpp

8 months ago

@952994032Augus @wangzhian8848 谁先挑的头？

0

11

0

855

Pan @E_tailpp

8 months ago

@ChinaMacroFacts 谁玩砸了还不知道呢

0

1

0

5K

E_tailpp retweeted

WarTranslated

@wartranslated

9 months ago

An AI-guided FPV drone from Ukraine’s 40th Brigade tracked and hit a Russian boat. A follow-up strike with an Orion loitering munition finished the job.

13

679

69

31

43K

Pan @E_tailpp

10 months ago

@shaneraki @degewa 这个达赖自己都是通过这种方式选来的，他为什么不遵守这套规则

0

1

0

251

Pan @E_tailpp

over 1 year ago

Investigate MIT Media Lab Professor Rosalind Picard for Racist Remarks https://t.co/L8mT7c2EEi 来自 @Change

0

177

E_tailpp retweeted

Marktechpost AI

@Marktechpost

about 2 years ago

LLaVA-NeXT: Advancements in Multimodal Understanding and Video Comprehension Researchers from Nanyang Technological University, University of Wisconsin-Madison, and Bytedance have developed LLaVA-NeXT, a pioneering open-source LMM trained solely on text-image data. The innovative AnyRes technique enhances reasoning, Optical Character Recognition (OCR), and world knowledge, showcasing exceptional performance across various image-based multimodal tasks. Surpassing Gemini-Pro on benchmarks like MMMU and MathVista, LLaVA-NeXT signifies a significant leap in multimodal understanding capabilities. Venturing into video comprehension, LLaVA-NeXT unexpectedly exhibits robust performance, featuring key enhancements. Leveraging AnyRes, it achieves zero-shot video representation, displaying unprecedented modality transfer ability for LMMs. The model’s length generalization capability effectively handles longer videos, surpassing token length constraints through linear scaling techniques. Further, supervised fine-tuning (SFT) and direct preference optimization (DPO) enhance the video understanding prowess. At the same time, efficient deployment via SGLang enables 5x faster inference, facilitating scalable applications like million-level video re-captioning. LLaVA-NeXT’s feats underscore its state-of-the-art performance and versatility across multimodal tasks, rivaling proprietary models like Gemini-Pro on key benchmarks. Quick read: https://t.co/0iglUay0K0 GitHub: https://t.co/0rGaYMMeWm #artificiallyinteligence #ai #artificiallntelligence @liuziwei7

Marktechpost's tweet photo. LLaVA-NeXT: Advancements in Multimodal Understanding and Video Comprehension

Researchers from Nanyang Technological University, University of Wisconsin-Madison, and Bytedance have developed LLaVA-NeXT, a pioneering open-source LMM trained solely on text-image data. The innovative AnyRes technique enhances reasoning, Optical Character Recognition (OCR), and world knowledge, showcasing exceptional performance across various image-based multimodal tasks. Surpassing Gemini-Pro on benchmarks like MMMU and MathVista, LLaVA-NeXT signifies a significant leap in multimodal understanding capabilities.

Venturing into video comprehension, LLaVA-NeXT unexpectedly exhibits robust performance, featuring key enhancements. Leveraging AnyRes, it achieves zero-shot video representation, displaying unprecedented modality transfer ability for LMMs. The model’s length generalization capability effectively handles longer videos, surpassing token length constraints through linear scaling techniques. Further, supervised fine-tuning (SFT) and direct preference optimization (DPO) enhance the video understanding prowess. At the same time, efficient deployment via SGLang enables 5x faster inference, facilitating scalable applications like million-level video re-captioning. LLaVA-NeXT’s feats underscore its state-of-the-art performance and versatility across multimodal tasks, rivaling proprietary models like Gemini-Pro on key benchmarks.

Quick read: https://t.co/0iglUay0K0

GitHub: https://t.co/0rGaYMMeWm

#artificiallyinteligence #ai #artificiallntelligence @liuziwei7

0

22

8

4

2K

E_tailpp retweeted

Awais @iAwaisRauf

about 2 years ago

Last year, I delivered a lecture on large vision-language models at @mbzuai, where I explored some interesting ideas through eight models. The content is a bit old but still relevant. Here are the slides: https://t.co/Arjxl9MTd6