Many youngsters are enthusiastically finding vulnerabilities in govt websites.
@IndianCERT should use this opportunity to launch a bug bounty program to incentivise and encourage them. Will build trust and also safeguard the Indian infra. Wont find a better opportunity than this.
Here’s an early signal to what I said
For almost an year now, we at @soketlabs have been working on curating a frontier scale pretraining data corpus along with finding the best architecture that fits the diversity of languages along with being compute optimal for both training and inference.
Sharing one of the many successes we have encountered. Our current version of the arch (yeah, its not a clone of deepseek or any other known arch) is at least 30% compute optimal to Deepseek’s sparse-MoE. These are just initial results and we hope to find a lot more. Shows that we have a lot more to learn about these architectures.
Also excited about the pretraining data we have curated but more on that later
Research efforts take time but they also yield exponential outcomes and thats what most people in India should be building towards
@Thomas_Tao_1@soketlabs I agree. This was a controlled experiment and early results. More ablations are in the pipeline to look for the impact due to data quality.
Here’s an early signal to what I said
For almost an year now, we at @soketlabs have been working on curating a frontier scale pretraining data corpus along with finding the best architecture that fits the diversity of languages along with being compute optimal for both training and inference.
Sharing one of the many successes we have encountered. Our current version of the arch (yeah, its not a clone of deepseek or any other known arch) is at least 30% compute optimal to Deepseek’s sparse-MoE. These are just initial results and we hope to find a lot more. Shows that we have a lot more to learn about these architectures.
Also excited about the pretraining data we have curated but more on that later
Research efforts take time but they also yield exponential outcomes and thats what most people in India should be building towards
We at Soket AI are hiring for HPC Infra Engineer towards our foundation model efforts. Apply and do share within your network.
#HiringAlert#FoundationModel#AI#LLM#GPU
https://t.co/WMWz9o3Las
Precisely! Very soon most of their work will be ingested in the model layer and they wont have much to offer
People also believe that AI will bring new demand for the service sector. That might be true in the immediate future because AI needs human intervention but AI will get a lot better and replace the AI service sector itself. Services is not a viable business model anymore. IP and innovation is the only way forward. Invest in R&D.
So we are moving toward chinese closed source models now.
Soon the argument that India should just “leverage open source” instead of building its own models will end up where it belongs - in the graveyard.
Thats right. They are treating frontier AI like traditional software.
Many of these large IT services orgs have partnerships with OpenAI and Anthropic hoping it would save their business model. Now they are realising that these labs are eating their business one segment at a time and yet they continue to wait and watch
@Hardik_Meisheri Thats so true. Have spoken to many of these leaders. They are excited but no one wants to wander too far into the unknown. One mistake and they will be blamed for breaking what already works