Earther

@EartherAI

Researching Learning Systems | Deep Learning × Vision Mamba x Fusion Models

India

Joined January 2025

4.2K Following

361 Followers

489 Posts

Pinned Tweet

Earther @EartherAI

6 months ago

EartherAI's tweet photo. https://t.co/9W98kpkgHk

Earther @EartherAI

6 months ago

Day 57 of #100DaysOfML Today was about generative classifiers—specifically Naive Bayes. Key takeaways: Modeling P(x∣y)P(x \mid y)P(x∣y) instead of P(y∣x)P(y \mid x)P(y∣x) changes how you think about learning Class-conditional independence → simple likelihoods, strong baselines MLE for Bernoulli & Gaussian NB is surprisingly intuitive With equal covariance, Gaussian NB yields a linear decision boundary Zero-frequency issues → Laplace smoothing is not optional Simple assumptions. Strong theory. Still competitive in practice.

$EartherAI's tweet photo. Day 57 of #100DaysOfML Today was about generative classifiers—specifically Naive Bayes. Key takeaways: Modeling P(x∣y)P(x \mid y)P(x∣y) instead of P(y∣x)P(y \mid x)P(y∣x) changes how you think about learning Class-conditional independence → simple likelihoods, strong baselines MLE for Bernoulli & Gaussian NB is surprisingly intuitive With equal covariance, Gaussian NB yields a linear decision boundary Zero-frequency issues → Laplace smoothing is not optional Simple assumptions. Strong theory. Still competitive in practice.$

$EartherAI's tweet photo. Day 57 of #100DaysOfML Today was about generative classifiers—specifically Naive Bayes. Key takeaways: Modeling P(x∣y)P(x \mid y)P(x∣y) instead of P(y∣x)P(y \mid x)P(y∣x) changes how you think about learning Class-conditional independence → simple likelihoods, strong baselines MLE for Bernoulli & Gaussian NB is surprisingly intuitive With equal covariance, Gaussian NB yields a linear decision boundary Zero-frequency issues → Laplace smoothing is not optional Simple assumptions. Strong theory. Still competitive in practice.$

$EartherAI's tweet photo. Day 57 of #100DaysOfML Today was about generative classifiers—specifically Naive Bayes. Key takeaways: Modeling P(x∣y)P(x \mid y)P(x∣y) instead of P(y∣x)P(y \mid x)P(y∣x) changes how you think about learning Class-conditional independence → simple likelihoods, strong baselines MLE for Bernoulli & Gaussian NB is surprisingly intuitive With equal covariance, Gaussian NB yields a linear decision boundary Zero-frequency issues → Laplace smoothing is not optional Simple assumptions. Strong theory. Still competitive in practice.$

$EartherAI's tweet photo. Day 57 of #100DaysOfML Today was about generative classifiers—specifically Naive Bayes. Key takeaways: Modeling P(x∣y)P(x \mid y)P(x∣y) instead of P(y∣x)P(y \mid x)P(y∣x) changes how you think about learning Class-conditional independence → simple likelihoods, strong baselines MLE for Bernoulli & Gaussian NB is surprisingly intuitive With equal covariance, Gaussian NB yields a linear decision boundary Zero-frequency issues → Laplace smoothing is not optional Simple assumptions. Strong theory. Still competitive in practice.$

0

4

0

0

552

1

7

0

0

547

Earther @EartherAI

9 days ago

0

2

0

0

236

Earther @EartherAI

12 days ago

@soydotrun @Amrita_Bh @liuhuan @SCAI_ASU Wow!

0

1

0

0

92

Earther @EartherAI

about 1 month ago

@chastronomic @WhatsApp Soo True

0

1

0

0

28

Earther @EartherAI

about 1 month ago

First round of 7-hour training is done. Next up: 6 hours of A100 experiments and analyzing the first phase results.

Earther @EartherAI

about 1 month ago

Training 2 models simultaneously on NVIDIA A100. Parallel experimentation, faster iteration, better research velocity.

EartherAI's tweet photo. Training 2 models simultaneously on NVIDIA A100. Parallel experimentation, faster iteration, better research velocity. https://t.co/oK6ZPzJaHX

0

3

0

0

211

1

5

0

0

111

Earther @EartherAI

about 1 month ago

@novasarc01 Wow! Congratulations 🎉🎉

0

2

0

0

109

Earther @EartherAI

about 1 month ago

Training 2 models simultaneously on NVIDIA A100. Parallel experimentation, faster iteration, better research velocity.

EartherAI's tweet photo. Training 2 models simultaneously on NVIDIA A100. Parallel experimentation, faster iteration, better research velocity. https://t.co/oK6ZPzJaHX

0

3

0

0

211

Earther @EartherAI

3 months ago

Day 65 of #100DaysofML Finished Artificial Neural networks; Multiclass classification -End of Machine Learning Theory

EartherAI's tweet photo. Day 65 of #100DaysofML

Finished Artificial Neural networks; Multiclass classification

-End of Machine Learning Theory https://t.co/t5Pivmd4YQ

Earther @EartherAI

3 months ago

Day 64 of #100DaysofML Complated theory of Ensemble methods - Bagging and Boosting (Adaboost)

EartherAI's tweet photo. Day 64 of #100DaysofML

Complated theory of Ensemble methods - Bagging and Boosting (Adaboost) https://t.co/eV4ZYLgFS9

1

4

0

0

279

0

3

0

1

113

Earther @EartherAI

3 months ago

Day 64 of #100DaysofML Complated theory of Ensemble methods - Bagging and Boosting (Adaboost)

EartherAI's tweet photo. Day 64 of #100DaysofML

Complated theory of Ensemble methods - Bagging and Boosting (Adaboost) https://t.co/eV4ZYLgFS9

Earther @EartherAI

4 months ago

Day 63 of #100DaysOfML - Ensembles & the Bias–Variance Game Generative vs Discriminative. Weak → Strong learners. Bagging → ↓ Variance (parallel, averaging). Boosting → ↓ Bias (sequential, adaptive). Overfit = high variance. Underfit = high bias. Smart aggregation > single model.

EartherAI's tweet photo. Day 63 of #100DaysOfML - Ensembles & the Bias–Variance Game

Generative vs Discriminative.
Weak → Strong learners.
Bagging → ↓ Variance (parallel, averaging).
Boosting → ↓ Bias (sequential, adaptive).

Overfit = high variance.
Underfit = high bias.
Smart aggregation > single model.

0

4

0

0

236

1

4

0

0

279

Earther @EartherAI

4 months ago

Day 63 of #100DaysOfML - Ensembles & the Bias–Variance Game Generative vs Discriminative. Weak → Strong learners. Bagging → ↓ Variance (parallel, averaging). Boosting → ↓ Bias (sequential, adaptive). Overfit = high variance. Underfit = high bias. Smart aggregation > single model.

EartherAI's tweet photo. Day 63 of #100DaysOfML - Ensembles & the Bias–Variance Game

Generative vs Discriminative.
Weak → Strong learners.
Bagging → ↓ Variance (parallel, averaging).
Boosting → ↓ Bias (sequential, adaptive).

Overfit = high variance.
Underfit = high bias.
Smart aggregation > single model.

Earther @EartherAI

4 months ago

Day 62 of #100DaysOfML Today cracking open the math behind Soft-Margin SVMs! The real magic lies in the Dual Problem and Complementary Slackness. 🟢 The Safe Zone: Correctly classified and safely past the margin. 🟡 The Support Vectors : Sitting perfectly on the margin edge. These define the boundary! 🔴 The Margin Violators: Points that are either inside the margin or completely misclassified. It's how complex optimization math translates into such clear geometric intuition!

EartherAI's tweet photo. Day 62 of #100DaysOfML

Today cracking open the math behind Soft-Margin SVMs!

The real magic lies in the Dual Problem and Complementary Slackness.

🟢 The Safe Zone:
Correctly classified and safely past the margin.

🟡 The Support Vectors :
Sitting perfectly on the margin edge. These define the boundary!

🔴 The Margin Violators:
Points that are either inside the margin or completely misclassified.

It's how complex optimization math translates into such clear geometric intuition!

0

2

0

0

146

0

4

0

0

236

Earther @EartherAI

4 months ago

Day 62 of #100DaysOfML Today cracking open the math behind Soft-Margin SVMs! The real magic lies in the Dual Problem and Complementary Slackness. 🟢 The Safe Zone: Correctly classified and safely past the margin. 🟡 The Support Vectors : Sitting perfectly on the margin edge. These define the boundary! 🔴 The Margin Violators: Points that are either inside the margin or completely misclassified. It's how complex optimization math translates into such clear geometric intuition!

EartherAI's tweet photo. Day 62 of #100DaysOfML

Today cracking open the math behind Soft-Margin SVMs!

The real magic lies in the Dual Problem and Complementary Slackness.

🟢 The Safe Zone:
Correctly classified and safely past the margin.

🟡 The Support Vectors :
Sitting perfectly on the margin edge. These define the boundary!

🔴 The Margin Violators:
Points that are either inside the margin or completely misclassified.

It's how complex optimization math translates into such clear geometric intuition!

Earther @EartherAI

5 months ago

Day 61 of #100DaysOfML – Logistic Regression Today was about the math behind Logistic Regression: • Sigmoid maps scores → probabilities • Likelihood → log-likelihood for numerical stability • Objective = maximize log-likelihood (or minimize cross-entropy) • Gradient: ∑ xᵢ (yᵢ − σ(wᵀxᵢ)) • Update rule: w ← w + η ∇logL • Regularization adds λ/2 ‖w‖² to prevent overfitting • Kernelization possible via w* = Σ αᵢ xᵢ From probability model → optimization → gradients → regularization.

EartherAI's tweet photo. Day 61 of #100DaysOfML – Logistic Regression

Today was about the math behind Logistic Regression:

• Sigmoid maps scores → probabilities
• Likelihood → log-likelihood for numerical stability
• Objective = maximize log-likelihood (or minimize cross-entropy)
• Gradient: ∑ xᵢ (yᵢ − σ(wᵀxᵢ))
• Update rule: w ← w + η ∇logL
• Regularization adds λ/2 ‖w‖² to prevent overfitting
• Kernelization possible via w* = Σ αᵢ xᵢ
From probability model → optimization → gradients → regularization.

1

6

0

1

219

0

2

0

0

146

Earther @EartherAI

4 months ago

Reminds of this ... by @ilyasut

EartherAI's tweet photo. Reminds of this ... by @ilyasut https://t.co/jOvBIagsXv

Derya Unutmaz, MD

4 months ago

The next several years will see the greatest destruction of human ego in history! In the age of AI, there will be three big losers: 1) Those who maintain high ego & arrogance about their intellectual abilities; 2) Those whose careers depends on gatekeeping; 3) AI deniers.

194

1K

181

270

89K

0

1

0

0

74

Earther @EartherAI

5 months ago

Hard take: If India builds its thorium reactor within the next 2-5 years, it can become one of the world's superpowers. Otherwise, the train has already left. This is the only visible move - move 37 - left for India.

0

4

0

0

116

Earther @EartherAI

5 months ago

@theJayAlto Metaphor ; Biology Meets Modern Life.

0

1

0

0

39

Earther @EartherAI

5 months ago

@learner_03038 Thank you!

0

1

0

0

8

Earther @EartherAI

5 months ago

Day 61 of #100DaysOfML – Logistic Regression Today was about the math behind Logistic Regression: • Sigmoid maps scores → probabilities • Likelihood → log-likelihood for numerical stability • Objective = maximize log-likelihood (or minimize cross-entropy) • Gradient: ∑ xᵢ (yᵢ − σ(wᵀxᵢ)) • Update rule: w ← w + η ∇logL • Regularization adds λ/2 ‖w‖² to prevent overfitting • Kernelization possible via w* = Σ αᵢ xᵢ From probability model → optimization → gradients → regularization.

EartherAI's tweet photo. Day 61 of #100DaysOfML – Logistic Regression

Today was about the math behind Logistic Regression:

• Sigmoid maps scores → probabilities
• Likelihood → log-likelihood for numerical stability
• Objective = maximize log-likelihood (or minimize cross-entropy)
• Gradient: ∑ xᵢ (yᵢ − σ(wᵀxᵢ))
• Update rule: w ← w + η ∇logL
• Regularization adds λ/2 ‖w‖² to prevent overfitting
• Kernelization possible via w* = Σ αᵢ xᵢ
From probability model → optimization → gradients → regularization.

Earther @EartherAI

6 months ago

Day 60/100 #100DaysOfML Diving into Support Vector Machines intuition! From perceptrons to max margins for better classifiers. Here's a structured breakdown: Perceptron Foundation: Mistake bound is # mistakes ≤ R²/γ², where γ is the margin. Larger margins = fewer errors & better generalization. Goal: Formulate optimization to directly maximize the margin, avoiding small-margin pitfalls. Key Derivations:Normalize weights: Set ||w||=1, then maximize γ s.t. y_i (w·x_i) ≥ γ for all i (avoids scaling issues). Equivalent form: Fix functional margin to 1, minimize (1/2)||w||² s.t. y_i (w·x_i) ≥ 1. Geometric margin: Simplifies to 2/||w||.

EartherAI's tweet photo. Day 60/100 #100DaysOfML
Diving into Support Vector Machines intuition!

From perceptrons to max margins for better classifiers. Here's a structured breakdown:

Perceptron Foundation: Mistake bound is # mistakes ≤ R²/γ², where γ is the margin. Larger margins = fewer errors & better generalization.

Goal: Formulate optimization to directly maximize the margin, avoiding small-margin pitfalls.

Key Derivations:Normalize weights: Set ||w||=1, then maximize γ s.t. y_i (w·x_i) ≥ γ for all i (avoids scaling issues).

Equivalent form: Fix functional margin to 1, minimize (1/2)||w||² s.t. y_i (w·x_i) ≥ 1.
Geometric margin: Simplifies to 2/||w||.

2

6

0

0

228

1

6

0

1

219

Earther @EartherAI

5 months ago

@thepanshu_logs Hey I have been working on a paper too can you suggest the platform for digram making.

1

3

0

0

131

Earther @EartherAI

6 months ago

@saurabhtwq I have to try this out!!

0

1

0

0

27

Earther @EartherAI

6 months ago

From tomorrow back to maths bulid UP!!

0

4

0

0

49

Earther @EartherAI

6 months ago

Day 60/100 #100DaysOfML Diving into Support Vector Machines intuition! From perceptrons to max margins for better classifiers. Here's a structured breakdown: Perceptron Foundation: Mistake bound is # mistakes ≤ R²/γ², where γ is the margin. Larger margins = fewer errors & better generalization. Goal: Formulate optimization to directly maximize the margin, avoiding small-margin pitfalls. Key Derivations:Normalize weights: Set ||w||=1, then maximize γ s.t. y_i (w·x_i) ≥ γ for all i (avoids scaling issues). Equivalent form: Fix functional margin to 1, minimize (1/2)||w||² s.t. y_i (w·x_i) ≥ 1. Geometric margin: Simplifies to 2/||w||.

EartherAI's tweet photo. Day 60/100 #100DaysOfML
Diving into Support Vector Machines intuition!

From perceptrons to max margins for better classifiers. Here's a structured breakdown:

Perceptron Foundation: Mistake bound is # mistakes ≤ R²/γ², where γ is the margin. Larger margins = fewer errors & better generalization.

Goal: Formulate optimization to directly maximize the margin, avoiding small-margin pitfalls.

Key Derivations:Normalize weights: Set ||w||=1, then maximize γ s.t. y_i (w·x_i) ≥ γ for all i (avoids scaling issues).

Equivalent form: Fix functional margin to 1, minimize (1/2)||w||² s.t. y_i (w·x_i) ≥ 1.
Geometric margin: Simplifies to 2/||w||.

Earther @EartherAI

6 months ago

EartherAI's tweet photo. https://t.co/9W98kpkgHk

1

7

0

0

547

2

6

0

0

228

Last Seen Users on Sotwe

Trends for you

Most Popular Users