Tom Nicholas @TEGNicholasCode - Twitter Profile

Pinned Tweet

over 1 year ago

At AGU I talked to NASA people about how agencies could better support open-source tools they rely on. I argued that our recent collaboration between Xarray and NASA ESDIS on xarray.DataTree was a good model to copy - read about how it happened here! https://t.co/uTJ6VFLjHD

1

25

4

6

2K

TEGNicholasCode retweeted

Beto @betolink

over 1 year ago

Science needs a social network for sharing big data https://t.co/rOUaN9Us60 by @TEGNicholasCode

0

4

1

244

TEGNicholasCode retweeted

Pangeo @pangeo_data

over 1 year ago

We're moving over to BlueSky and LinkedIn for all our future announcements. Follow us at https://t.co/CfejMIiPuH to find out more about tomorrow's showcase 😉 (p.s., it's on Xpublish at Scale at 4 PM EST 🚀) Connect with us on LinkedIn at https://t.co/b0duHratPH

0

3

0

1K

TEGNicholasCode retweeted

Xarray @xarray_dev

over 1 year ago

Our friend's over at @zarr_dev made a big release today! Xarray v2025.01.1 was also released today with full support for Zarr-Python 3 🚀

0

25

8

4

4K

Who to follow

Ryan Abernathey

@rabernat

Scientist and Startup Founder Co-founder and CEO @EarthmoverHQ @pangeo_data steering council member ex-Professor @columbia @lamontEarth

Xarray

@xarray_dev

N-D labeled arrays and datasets in Python

Earthmover

@EarthmoverHQ

the cloud platform for scientific data teams. Come find us: https://t.co/pQcSbgRbw5

TEGNicholasCode retweeted

Earthmover @EarthmoverHQ

over 1 year ago

🌤️ #AMS2025 is just around the corner! We are taking AMS by storm with an exhibitor booth (booth 353), two talks from @_jhamman and @rabernat , and hosting a @pangeo_data Community Happy Hour (register here: https://t.co/7CujhX87AC)!

EarthmoverHQ's tweet photo. 🌤️ #AMS2025 is just around the corner! We are taking AMS by storm with an exhibitor booth (booth 353), two talks from @_jhamman and @rabernat , and hosting a @pangeo_data Community Happy Hour (register here: https://t.co/7CujhX87AC)! https://t.co/aBrh464n3r

1

11

6

1

1K

Tom Nicholas @TEGNicholasCode

over 1 year ago

@rogercreel I'm there too! https://t.co/63ivZJ4MVa

0

68

Tom Nicholas @TEGNicholasCode

over 1 year ago

At AGU I talked to NASA people about how agencies could better support open-source tools they rely on. I argued that our recent collaboration between Xarray and NASA ESDIS on xarray.DataTree was a good model to copy - read about how it happened here! https://t.co/uTJ6VFLjHD

1

25

4

6

2K

Tom Nicholas @TEGNicholasCode

over 1 year ago

@ladino_123 Thanks for your help @ladino_123 !

0

2

0

75

Tom Nicholas @TEGNicholasCode

over 1 year ago

@alekpetty I'm hoping that virtual zarr datasets will make it easier to cloud-optimize data that was dumped in a bucket in a legacy format, and allow creating aggregated datasets with relevant derived information alongside it. https://t.co/x39eXZ82Zf

1

0

75

Tom Nicholas @TEGNicholasCode

over 1 year ago

Completely agree - "in theory" we have the simple scalability of the cloud, but in practice it's often a headache, for no good reason, which prevents adoption by most users (including many scientists)

Matthew Rocklin @mrocklin

over 1 year ago

New Post: Cloud Computing is Broken https://t.co/Ode3eXkGFO Investor asks: "What's next for Data/Cloud Infrastructure?" My answer: "Boring stuff. People struggle with basics." Cloud feels like MP3 players before iPod. In theory everything is good. In practice adoption is low

1

22

3

6

2K

1

5

0

2

304

Tom Nicholas @TEGNicholasCode

over 1 year ago

@alekpetty Makes total sense. On (1) and (2) some intermediate services (e.g. Coiled, Modal) would like to sell you the solution to this, but it's annoying that NASA + AWS can't just get it right first time On (3) - is your data in the cloud at least? If not in cloud-optimized format?

1

0

65

TEGNicholasCode retweeted

Ian Schuler @ianschuler

over 1 year ago

@mouthofmorrison @rabernat @betolink @EarthmoverHQ @steadyflux That said, it isn't 100% clear that NASA's best move is to immediately convert 10000+ data sets into cutting edge ARCO formats. Kerchunk and Virtual Zarr offer benefits of ARCO while keeping data in the native formats.

1

11

2

3

3K

Tom Nicholas @TEGNicholasCode

over 1 year ago

@betolink @mouthofmorrison @rabernat @EarthmoverHQ It doesn't need to be duplicated if you use the VirtualiZarr/Kerchunk approach...

0

2

0

112

Tom Nicholas @TEGNicholasCode

over 1 year ago

I'll also be there if you want to join me working on @xarray_dev , DataTree, or VirtualiZarr!

Joe Hamman @_jhamman

over 1 year ago

Are you heading to #AGU24 next month? Consider joining us for a bonus day of hacking on @pangeo_data. I'll be there representing @EarthmoverHQ and helping folks work with #icechunk and @zarr_dev. Details and signup here: https://t.co/kgUuokUo3k

0

14

1

0

1K

0

5

1

0

335

TEGNicholasCode retweeted

Deepak Cherian @cherian_deepak

over 1 year ago

Come learn about recent @xarray_dev GroupBy improvements at tomorrow's (Wed, Nov 13) Pangeo Showcase! https://t.co/K0Wi0ZQK43

cherian_deepak's tweet photo. Come learn about recent @xarray_dev GroupBy improvements at tomorrow's (Wed, Nov 13) Pangeo Showcase!

https://t.co/K0Wi0ZQK43 https://t.co/SfHhV23ec6

1

28

7

2

2K

Tom Nicholas @TEGNicholasCode

over 1 year ago

@_JacobTomlinson @pydatanyc Oh I didn't know about this - I could have gone easily!

1

0

100

TEGNicholasCode retweeted

Joe Hamman @_jhamman

over 1 year ago

We've talked a lot about #Icechunk's performance this week 🚀. But the Zarr-Python 3 results are also very encouraging! We're a few weeks away from the 3.0 launch but what this chart shows is that the new AsyncIO + multi-threading functionality in Zarr is going to be really good.

0

8

1

0

626

Tom Nicholas @TEGNicholasCode

over 1 year ago

All these integrations represent literally years-worth of effort, all coming out at once 🤯 And that's not even mentioning all the other changes you see in a typical xarray release!

1

9

0

288

Tom Nicholas @TEGNicholasCode

over 1 year ago

Xarray v2024.10.0 has just been released, including support for xarray.DataTree and zarr-python v3 !!! https://t.co/LB2cvjuJKx @xarray_dev @zarr_dev

2

95

26

16

13K

Tom Nicholas @TEGNicholasCode

over 1 year ago

ALSO this release is the first to be compatible with the much anticipated v3 implementation of zarr-python! (still on its beta branch right now) This brings big performance benefits when reading @zarr_dev on S3 via async and (b) compatibility with @EarthmoverHQ 's Icechunk.

TEGNicholasCode's tweet photo. ALSO this release is the first to be compatible with the much anticipated v3 implementation of zarr-python! (still on its beta branch right now)

This brings big performance benefits when reading @zarr_dev on S3 via async and (b) compatibility with @EarthmoverHQ 's Icechunk. https://t.co/Y0m4f7MuWe

1

6

0

948

Tom Nicholas

@TEGNicholasCode

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users