i have never come across anyone who worked at google from the beginning to ~2012 being anything other than a great human being.
google trained people to be non assholes cuz the founders were very much nice dudes & really cared about everyone that worked there.
e.g. they literally handed envelopes of pure cash during christmas to employees.
No value judgment in that by the way, there is a lot of value in thinking deeply and for a long time to understand e.g. sociological phenomena. It just won't lead to high publication output without an external incentive.
CS is hardly lab based, unless you're building robots... My theory about this is that this is a difference between fields where the emphasis is on theories (which you can refine indefinitely) vs experiments (which have to finish at some point).
Research output after tenure drops off a cliff for business, economics, sociology, and other non-lab fields.
But it remains high post-tenure in lab-based fields such as chemistry, physics, computer science, and engineering.
It has not been reported much, but I believe ETH Zurich has, as of last week, banned new Master and PhD students who attended a long list of universities in China, Russia, and Iran. ๐งต
It has not been reported much, but I believe ETH Zurich has, as of last week, banned new Master and PhD students who attended a long list of universities in China, Russia, and Iran. ๐งต
I wonder if a 3D-aware encoding (whether Rope or another approach) in a sequence to expression model could use information from HiC, since we generally don't know the full 3D structure of DNA, nor is it static...
A common misconception about Transformers is to believe that they're a sequence-processing architecture. They're not.
They're a *set-processing* architecture. Transformers are 100% order-agnostic (which was the big innovation compared to RNNs, back in late 2016 -- you compute the full matrix of pairwise token interactions instead of processing one token at a time).
The way you add order awareness in a Transformer is at the *feature* level. You literally add to your token embeddings a position embedding / encoding that corresponds to its place in a sequence. The architecture itself just treats the input tokens as a set.
There's nothing that prevents you from applying the same trick to 2D grid of tokens or a 3D cube of tokens instead! Instead of adding an embedding for (i * N + j), the position of (i, j) in a flattened grid, just embed i and j and concat the result.
Now you have a 2D position embedding that makes your Transformer natively process grids of tokens! This can be useful for ARC-AGI, since ARC-AGI grids are of course 2D grids of discrete tokens, not sequences.
In fact one of the main achievements of LLMs should be too teach us that much of what we admire in successful people (being articulate, formulating convincing arguments, accessing knowledge in their head quickly) is pattern matching rather than reasoning.
Worth repeating:
Do not confuse retrieval with reasoning.
Do not confuse rote learning with understanding.
Do not confuse accumulated knowledge with intelligence.
I won't do as good a job as @NandoDF, but thanks to the @NobelPrize for giving me a decent chance at explaining both the Chemistry and Physics Prize. Computational biologist or polymath, you decide...
For the first time in my life I can explain what the physics @NobelPrize is about!
In fact, if youโd like to learn what is a Hopfield net and how it relates to NP hard satisfiability, Boltzmann machines, autoencoders, score matching, Maxwell demons, maximum likelihood, generative AI, quantum computing, unsupervised learning and neural networks, see these slides and video lectures from a course I taught at @ipam_ucla 2012
https://t.co/4kcR14kSuz
https://t.co/PueyqN9lsn
https://t.co/L523CfjNcg
https://t.co/iURLQM8HRb
For the first time in my life I can explain what the physics @NobelPrize is about!
In fact, if youโd like to learn what is a Hopfield net and how it relates to NP hard satisfiability, Boltzmann machines, autoencoders, score matching, Maxwell demons, maximum likelihood, generative AI, quantum computing, unsupervised learning and neural networks, see these slides and video lectures from a course I taught at @ipam_ucla 2012
https://t.co/4kcR14kSuz
https://t.co/PueyqN9lsn
https://t.co/L523CfjNcg
https://t.co/iURLQM8HRb
I would have used TikTok instead of Tinder as the example, but otherwise pretty spot on. We all used to be somewhere between Wall-E and C3PO for a definition of AI, now the best you can hope for is "not Magic 8 Ball".
@anshulkundaje Agreed - - this is part of my long-running argument that what is holding back deep learning in biology is at least in part the lack of custom architectures that address biological processes.
Nice work. The issue with current seq-> expression models in terms of their very poor performance on variant effect prediction is not that they need to see more variation data. The problem is they don't learn long range effects of enhancers effectively. 1/
We are now hiring into our new Biostatistical Machine Learning theme @MRC_BSU ! We are looking for: (1) a Junior Group Leader in AI/ML for Biomedicine (w/ startup package including an RA) who will help shape the theme; and (2) an RA to join my group. Details below. Please RT :)
We have an open position for a Data Science Innovation Fellow in my group at Novartis Biomedical Research in Oncology! If you are interested in doing AI research with direct impact on oncology drug discovery, please see below. Feel free to reply or DM me. https://t.co/U6Uot3Fzuj
Lior and I wrote a blog post about what it was like for me to find occurrences of duplicated (and seemingly manipulated) data as a first-year graduate student at @Caltech. I am speaking openly because I believe that we, as the scientific community, can do better.