Yuhang He (Henry) @henryoxplore - Twitter Profile

Pinned Tweet

about 2 months ago

I am honored to introduce my new work, XShapeEnc: Training-free Spatially-Grounded Geometry Shape Encoding. This innovative approach is capable of encoding an arbitrary spatially-grounded 2D geometric shape into high-dimensional space without any training. Key features include: - Utilization of the classic Zernike basis for efficient encoding of shape geometry and shape pose, either jointly or separately. - An interpretable and invertible encoding process that is rich in high-frequency details. XShapeEnc offers wide applicability for various downstream tasks that require shape analysis. For more details: paper: https://t.co/qcLa6zgtEj code: https://t.co/eFVBxTyXzI

HenryOxplore's tweet photo. I am honored to introduce my new work, XShapeEnc: Training-free Spatially-Grounded Geometry Shape Encoding. This innovative approach is capable of encoding an arbitrary spatially-grounded 2D geometric shape into high-dimensional space without any training.
Key features include:
- Utilization of the classic Zernike basis for efficient encoding of shape geometry and shape pose, either jointly or separately.
- An interpretable and invertible encoding process that is rich in high-frequency details.
XShapeEnc offers wide applicability for various downstream tasks that require shape analysis.
For more details:
paper: https://t.co/qcLa6zgtEj
code: https://t.co/eFVBxTyXzI

0

2

0

176

Yuhang He (Henry)

@HenryOxplore

about 2 months ago

Honored to see my work has been featured by this blog. Deep Neural Networks: From Trustworthy Explanations to Robust Autonomous Systems https://t.co/xlJuItHhIM

0

82

Yuhang He (Henry)

@HenryOxplore

4 months ago

🚀 [ICLR 2026] Existing text-to-audio generation (TTA) methods mainly focus on semantic correctness, yet they perform very poorly on relation-aware TTA generation. For example, current models achieve <30% audio event presence accuracy and <10% relation accuracy. In our newly accepted ICLR 2026 paper, we introduce Aurelius, a framework that enables relation-aware TTA research at scale. Specifically, we introduce two meticulously curated corpora: 🗂 AudioEventSet — 110 audio events across 7 major classes. 🗂 AudioRelSet — 100 relations across 6 major relation types. Based on the two corpora and the proposed data creation strategy, we can create massive (nearly unlimited) <text, audio> pairs with both • high linguistic diversity. • high acoustic diversity. We release all resources to support the broader community in AI, acoustics, computer vision, and multimodal research. 📄 Paper: https://t.co/ZzDiCo5gJm 🗂 Dataset: https://t.co/Qe9fc2kPKW 💻 Code: https://t.co/52G6h6pqPP 🌐 Project Page: https://t.co/zI1gyP2yko Huge thanks to Andrew Markham, He Liang, @_jainyash and @VibhavVineet at Microsoft Research and University of Oxford for their unwavering support. #ICLR2026 #Multimodality

HenryOxplore's tweet photo. 🚀 [ICLR 2026] Existing text-to-audio generation (TTA) methods mainly focus on semantic correctness, yet they perform very poorly on relation-aware TTA generation. For example, current models achieve <30% audio event presence accuracy and <10% relation accuracy.

In our newly accepted ICLR 2026 paper, we introduce Aurelius, a framework that enables relation-aware TTA research at scale. Specifically, we introduce two meticulously curated corpora:

🗂 AudioEventSet — 110 audio events across 7 major classes.

🗂 AudioRelSet — 100 relations across 6 major relation types.

Based on the two corpora and the proposed data creation strategy, we can create massive (nearly unlimited) <text, audio> pairs with both

• high linguistic diversity.

• high acoustic diversity.

We release all resources to support the broader community in AI, acoustics, computer vision, and multimodal research.

📄 Paper: https://t.co/ZzDiCo5gJm

🗂 Dataset: https://t.co/Qe9fc2kPKW

💻 Code: https://t.co/52G6h6pqPP

🌐 Project Page: https://t.co/zI1gyP2yko

Huge thanks to Andrew Markham, He Liang, @_jainyash and @VibhavVineet at Microsoft Research and University of Oxford for their unwavering support.

#ICLR2026 #Multimodality

0

3

0

681

HenryOxplore retweeted

Jing Wu 🍄 @jingwu23

4 months ago

📣📣 Introducing OS-Marathon⏱️ I am proud to share this interesting project I did during my internship @Microsoft @MSFTResearch. 🥳 This work is about benchmarking computer-use agents' ability on a type of long-horizon, repetitive tasks. ✅ We focus on desktop workflows that are long-horizon and also repetitive, e.g. filling the expense system given a set of various travelling receipts. ✅ We create data, build tasks and environments and categorise them into multiple difficulty levels to enable fine-grained evaluation. 📎Project Page: https://t.co/NORAd1ptuL 📄Paper: https://t.co/UxERGlevQb Joint work with Daphne Barretto, Yiye Chen, Nicholas Gydé, Yanan Jian, Yuhang He @HenryOxplore, Vibhav Vineet @VibhavVineet

jingwu23's tweet photo. 📣📣 Introducing OS-Marathon⏱️

I am proud to share this interesting project I did during my internship @Microsoft @MSFTResearch. 🥳

This work is about benchmarking computer-use agents' ability on a type of long-horizon, repetitive tasks.

✅ We focus on desktop workflows that are long-horizon and also repetitive, e.g. filling the expense system given a set of various travelling receipts.
✅ We create data, build tasks and environments and categorise them into multiple difficulty levels to enable fine-grained evaluation.

📎Project Page: https://t.co/NORAd1ptuL
📄Paper: https://t.co/UxERGlevQb

Joint work with Daphne Barretto, Yiye Chen, Nicholas Gydé, Yanan Jian, Yuhang He @HenryOxplore, Vibhav Vineet @VibhavVineet

0

1

0

137

Who to follow

Deep Learning | Speech | Neuroscience | Mobile @[email protected]

7 months ago

ONE QUESTION Why doesn't this statement explain why this happened?

ICLR @iclr_conf

7 months ago

iclr_conf's tweet photo. https://t.co/Xqd8AEERb0

52

679

138

116

1M

0

202

Yuhang He (Henry)

@HenryOxplore

7 months ago

MSR Vancouver lab is looking for a Canada based Ph.D. intern that can start the internship asap (preferrably before Christmas). The internship is about LLM pre/post train. The candidate should hold a work permit already. DM me if anyone is interested.

0

95

Yuhang He (Henry)

@HenryOxplore

9 months ago

Our lab at MSR is also looking for multimodal learning intern in both Canada and US. Please apply via this link: https://t.co/BJKpN2uUx1

0

2

0

213

Yuhang He (Henry)

@HenryOxplore

9 months ago

Microsoft Research Vancouver lab is hiring multiple interns in Canada working at AI-Driven System Design. The successful applicant is expected to start interning as early as possible. Welcome to share to whoever might be interested in. Apply through: https://t.co/OALz8l77Pb

0

3

0

1

202

HenryOxplore retweeted

WiML @WiMLworkshop

12 months ago

Microsoft’s roundtable: Industrial-academic collaboration #WiML #ICML2025

1

7

1

0

616

HenryOxplore retweeted

Kosta Derpanis (sabbatical in Zurich)

@CSProfKGD

over 1 year ago

Next stop arXiv cleaner https://t.co/bU3xlOqR4h

3

308

37

183

21K

HenryOxplore retweeted

Nikunj Kothari

@nikunj

over 1 year ago

You can just do (hard) things..

64

8K

1K

6K

598K

Yuhang He (Henry)

@HenryOxplore

over 1 year ago

Existing text-to-audio generation models fail to model audio events relations. We fill in this gap with a new benchmark, evaluation protocol. Title: RiTTA: Modeling Event Relations in Text-to-Audio Generation. Project site: https://t.co/ehCpuMP7Bf Code: https://t.co/PnDdUQ5hmu

1

0

148

Yuhang He (Henry)

@HenryOxplore

almost 2 years ago

@leppert Hi Greg, for the job, do you hire postdoc?

0

182

Yuhang He (Henry)

@HenryOxplore

almost 2 years ago

On my way to #ICML2024 , look forward to catch up with new friends. I am also in the job market, look for research job in multimodal audio-visual-x learning.

1

0

419

Yuhang He (Henry)

@HenryOxplore

almost 2 years ago

Honored to introduce our new work: SPEAR, receiver-to-receiver neural warping field to predict spatial acoustic effects for one position from another reference position, without requiring source position. Code: https://t.co/PvfFnh5PfM. Paper: https://t.co/jqVawL6Tdw

0

1

0

346

Yuhang He (Henry)

@HenryOxplore

about 2 years ago

Happy to share that I have started an internship at Microsoft applied science group in Munich Germany.

0

1

0

219

Yuhang He (Henry)

@HenryOxplore

about 2 years ago

Happy to share the news that I have had one paper on spatial audio learning accepted by #ICML2024 !

0

3

0

227

HenryOxplore retweeted

Michael Black

@Michael_J_Black

about 2 years ago

Build what you need and use what you build. This is a core philosophy of my research. It shifts the focus away from publishing “papers” to what really matters — impact. This thread unpacks why I think this is a successful approach to science. 1/10 Or see: https://t.co/p3iWJ9LCzf

16

1K

258

756

158K

Yuhang He (Henry)

@HenryOxplore

about 2 years ago

@tfburns Thanks Tom, make sense to me. Lots of work to do on the author side then.

0

15

Yuhang He (Henry)

@HenryOxplore

over 2 years ago

I was invited to be an emergency reviewer for #icml . I accepted and tried my best to give the review. After seeing the released reviews, I suddenly found up to six reviews are here for that paper. Am I not an emergency reviewer at all then?🤣

1

0

1K

Yuhang He (Henry)

@HenryOxplore

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users