Chris

@Topenomics

Graduate student

Bangkok, Thailand

Joined August 2018

327 Following

50 Followers

844 Posts

Topenomics retweeted

Carson Mock @CoachCMock

about 1 month ago

Be the most competitive!! Competitiveness isn't just a personality trait-it's a decision. It's not about talent or size but about desire. You have the ability to choose your level of competitiveness. @BallisPsych

CoachCMock's tweet photo. Be the most competitive!!

Competitiveness isn't just a personality trait-it's a decision. It's not about talent or size but about desire. You have the ability to choose your level of competitiveness.

@BallisPsych https://t.co/iMhzW2dRlN

436

105

147

18K

Chris @Topenomics

about 1 month ago

Topenomics retweeted

PYOFLIFE.COM @Parajulisaroj16

about 2 months ago

This comprehensive guide sheds light on how integrating Python and R can lead to exceptional outcomes, unlocking a world of possibilities for contemporary data professionals. https://t.co/FNI4YN8KFr #datascience #RStats #datascientists #Pythonprogramming #MachineLearning

Parajulisaroj16's tweet photo. This comprehensive guide sheds light on how integrating Python and R can lead to exceptional outcomes, unlocking a world of possibilities for contemporary data professionals. https://t.co/FNI4YN8KFr
#datascience #RStats #datascientists #Pythonprogramming #MachineLearning https://t.co/6E1vITX1zT

399

268

14K

Topenomics retweeted

Dr. Banda Khalifa MD, MPH, MBA

@dr_bandak

2 months ago

Choosing a statistical test is one of the challenges students and early researchers face. Let me summarize this paper in 5 minutes. The right statistical test usually depends on these core questions: → What is your outcome variable? → Are your groups paired or unpaired? → Is your data continuous, categorical, or binary? → If continuous, is it normally distributed? Once you answer those, the fog starts to clear. ⸻ 1️⃣ Start with the type of variable Ask yourself: → Is the outcome continuous? e.g., blood pressure, weight, test score → Is it categorical/binary? e.g., yes/no, success/failure, disease/no disease That single distinction already removes much of the confusion. ⸻ 2️⃣ Then ask: paired or unpaired? This matters more than many students realize. → Unpaired = two different groups → Paired = the same subjects measured twice, or matched observations If you miss this step, you may choose the wrong test even when the variable type is correct. ⸻ 3️⃣ If the outcome is categorical Think along these lines: → Chi-square test → Fisher’s exact test → McNemar test for paired categorical data This is why chi-square does not belong everywhere. The data structure decides. ⸻ 4️⃣ If the outcome is continuous Now the next question is: → Normally distributed? or → Not normally distributed? That determines whether you use: Parametric tests like: → t-test → ANOVA Or non-parametric tests like: → Mann–Whitney U → Wilcoxon signed-rank → Kruskal–Wallis → Friedman test ⸻ 5️⃣ Do not choose the test after seeing the data This is one of the most important reminders in the paper. The statistical test, the null hypothesis, and the significance level should be specified before the study is carried out. That protects the integrity of the analysis. ⸻ 6️⃣ Most papers do not require you to know every test You can interpret a large proportion of medical research papers if you understand: → t-test → Chi-square test → Fisher’s exact test You do not need to master everything at once. You need to master the logic. ⸻ 7️⃣ The real skill is learning how to think through the decision: → What is the question? → What is the endpoint? → What is the data type? → What is the study design? Once that becomes clear, the test often becomes obvious. ⸻ 💬 Which statistical test confused you the most when you were starting? ———- Reference: du Prel JB, Röhrig B, Hommel G, Blettner M. Choosing Statistical Tests. Deutsches Ärzteblatt International. 2010;107(19):343–348.

dr_bandak's tweet photo. Choosing a statistical test is one of the challenges students and early researchers face. Let me summarize this paper in 5 minutes.

The right statistical test usually depends on these core questions:

→ What is your outcome variable?
→ Are your groups paired or unpaired?
→ Is your data continuous, categorical, or binary?

→ If continuous, is it normally distributed?

Once you answer those, the fog starts to clear.

⸻

1️⃣ Start with the type of variable

Ask yourself:

→ Is the outcome continuous?
e.g., blood pressure, weight, test score

→ Is it categorical/binary?
e.g., yes/no, success/failure, disease/no disease

That single distinction already removes much of the confusion.

⸻

2️⃣ Then ask: paired or unpaired?

This matters more than many students realize.

→ Unpaired = two different groups
→ Paired = the same subjects measured twice, or matched observations

If you miss this step, you may choose the wrong test even when the variable type is correct.

⸻

3️⃣ If the outcome is categorical

Think along these lines:

→ Chi-square test
→ Fisher’s exact test
→ McNemar test for paired categorical data

This is why chi-square does not belong everywhere.

The data structure decides.

⸻

4️⃣ If the outcome is continuous

Now the next question is:

→ Normally distributed?
or
→ Not normally distributed?

That determines whether you use:

Parametric tests like:
→ t-test
→ ANOVA

Or non-parametric tests like:
→ Mann–Whitney U
→ Wilcoxon signed-rank
→ Kruskal–Wallis
→ Friedman test

⸻

5️⃣ Do not choose the test after seeing the data

This is one of the most important reminders in the paper.

The statistical test, the null hypothesis, and the significance level should be specified before the study is carried out.

That protects the integrity of the analysis.

⸻

6️⃣ Most papers do not require you to know every test

You can interpret a large proportion of medical research papers if you understand:

→ t-test
→ Chi-square test
→ Fisher’s exact test

You do not need to master everything at once.

You need to master the logic.

⸻

7️⃣ The real skill is learning how to think through the decision:

→ What is the question?
→ What is the endpoint?
→ What is the data type?
→ What is the study design?

Once that becomes clear, the test often becomes obvious.

⸻

💬 Which statistical test confused you the most when you were starting?
———-

Reference:
du Prel JB, Röhrig B, Hommel G, Blettner M. Choosing Statistical Tests. Deutsches Ärzteblatt International. 2010;107(19):343–348.

189

141

Who to follow

Priyanka Mehta

@PriyankaMMehta

Bioinformatician/ PhD student at @IGIBsocial. Exploring Alternate Splicing during Host-Pathogen Interaction. #transcriptomics #bioinformatics #genomics

Antonio Alegría

@Elmedicobrujo

Data analyst | Behavioral Scientist | No tan amargado 🚲 🌱🧉

Aníbal A. Teherán

@md_teheran

Médico Epidemiólogo, Economista de la Salud, Magíster en Bioestadística y Salud Pública. Docente Investigador - FUJNC

Topenomics retweeted

Kirk Borne

@KirkDBorne

2 months ago

A First Course in Causal Inference: https://t.co/Ew9PcMQ8Ff [490-page PDF download] + Also see the book "Causal Inference in Statistics: A Primer" at https://t.co/ROEZZZAFqN by @yudapearl #Probability #Mathematics #DataScience #ML #MachineLearning #DataScientist #DataAnalysis

KirkDBorne's tweet photo. A First Course in Causal Inference: https://t.co/Ew9PcMQ8Ff [490-page PDF download]

+ Also see the book "Causal Inference in Statistics: A Primer" at https://t.co/ROEZZZAFqN by @yudapearl

#Probability #Mathematics #DataScience #ML #MachineLearning #DataScientist #DataAnalysis https://t.co/N48JB8W3I6

225

134

11K

Topenomics retweeted

Nick Bearman @NickBearmanUK

2 months ago

Just testing out my material in https://t.co/VuDqzXwXWv for my Introduction to Using R as a GIS course coming up on 28 & 29 April https://t.co/RYP1cGz462. https://t.co/VuDqzXwXWv is a great way of running R if you can't install it yourself! More details at https://t.co/Q54DKOhg1W

NickBearmanUK's tweet photo. Just testing out my material in https://t.co/VuDqzXwXWv for my Introduction to Using R as a GIS course coming up on 28 & 29 April https://t.co/RYP1cGz462. https://t.co/VuDqzXwXWv is a great way of running R if you can't install it yourself! More details at https://t.co/Q54DKOhg1W https://t.co/OtbgOP5NMx

333

189

13K

Topenomics retweeted

Curious Minds

@CuriousMindsHub

2 months ago

How To Understand Anything Become An Elite Thinker: Frameworks, Mental Models, and Tools for Understanding (Almost) Anything

CuriousMindsHub's tweet photo. How To Understand Anything

Become An Elite Thinker: Frameworks, Mental Models, and Tools for Understanding (Almost) Anything https://t.co/H9rqlzr6It

966

191

893

37K

Topenomics retweeted

Qiusheng Wu

@giswqs

2 months ago

Just received a proof copy of my new GeoAI book from Amazon! The printing quality is very good. It is now the #1 best-seller in GIS, Remote Sensing, and Computer Programming Languages. Grab your packpaer copy here: https://t.co/03KbrDXord PDF edition available in English, Spanish, Chinese, French, and German: https://t.co/i039IxTV6U #geospatial #geoai #opensource

638

500

36K

Topenomics retweeted

Probability and Statistics

@probnstat

2 months ago

Categorical data represents variables that take values from a finite set of discrete categories, such as labels or classes. Mathematically, a categorical variable X ∈ {1,…,K} is modeled using a probability vector p = (p₁,…,p_K), where ∑ p_k = 1 and P(X=k)=p_k. A common representation is one-hot encoding, where each category is mapped to a binary vector. In statistics, categorical data is analyzed using models like the multinomial distribution and tools such as contingency tables and chi-square tests. In machine learning, it is used in classification tasks, with models like logistic regression and softmax outputs: P(Y=k|x)=exp(z_k)/∑ⱼ exp(z_j). In real life, categorical data appears in surveys, medical diagnosis, recommendation systems, and customer segmentation, where understanding discrete group behavior enables better decisions and predictions.

$probnstat's tweet photo. Categorical data represents variables that take values from a finite set of discrete categories, such as labels or classes. Mathematically, a categorical variable X ∈ {1,…,K} is modeled using a probability vector p = (p₁,…,p_K), where ∑ p_k = 1 and P(X=k)=p_k. A common representation is one-hot encoding, where each category is mapped to a binary vector. In statistics, categorical data is analyzed using models like the multinomial distribution and tools such as contingency tables and chi-square tests. In machine learning, it is used in classification tasks, with models like logistic regression and softmax outputs: P(Y=k|x)=exp(z_k)/∑ⱼ exp(z_j). In real life, categorical data appears in surveys, medical diagnosis, recommendation systems, and customer segmentation, where understanding discrete group behavior enables better decisions and predictions.$

285

157

10K

Topenomics retweeted

Jaynit

@jaynitx

2 months ago

In 2019, MIT professor Patrick Winston gave a legendary 1-hour lecture called “How to Speak.” It has 18M+ views for a reason. His frameworks: • Your ideas are like your children • The 5-minute rule for job talks • Why jokes fail at the start 15 lessons on communication:

229

40K

89K

Topenomics retweeted

Curious Minds

@CuriousMindsHub

2 months ago

Writing is thinking

883

175

298

20K

Topenomics retweeted

𝗿𝗮𝗺𝗮𝗸𝗿𝘂𝘀𝗵𝗻𝗮— 𝗲/𝗮𝗰𝗰

@techwith_ram

2 months ago

A First Course in Casual Inference by Peng Ding PDF: https://t.co/FjmC3nBQ9G So many guys nowadays are eager to learn state-of-the-art theory and methods in causal inference so that they are better equipped to solve problems from various fields. This is a good book. It covers: - Correlation, Association, and the Yule–Simpson Paradox - Potential Outcomes and the Experimentalist's View - Treatment Assignment Mechanisms - Completely Randomized Experiments (CRE) - Fisher Randomization Test (FRT) - Canonical Choices of Test Statistics - Basic Probability Theory and Statistical Inference (Prerequisites) - Linear and Logistic Regressions - Neyman's Potential Outcomes Notation

techwith_ram's tweet photo. A First Course in Casual Inference by Peng Ding

PDF: https://t.co/FjmC3nBQ9G

So many guys nowadays are eager to learn state-of-the-art theory and methods in causal inference so that they are better equipped to solve problems from various fields.

This is a good book. It covers:

- Correlation, Association, and the Yule–Simpson Paradox
- Potential Outcomes and the Experimentalist's View
- Treatment Assignment Mechanisms
- Completely Randomized Experiments (CRE)
- Fisher Randomization Test (FRT)
- Canonical Choices of Test Statistics
- Basic Probability Theory and Statistical Inference (Prerequisites)
- Linear and Logistic Regressions
- Neyman's Potential Outcomes Notation

839

162

893

38K

Topenomics retweeted

Joachim Schork

@JoachimSchork

2 months ago

If you're still using raw R outputs for presentations, it's time for an upgrade! Tools like gtsummary bring your statistical results to life, making them much more digestible for non-technical audiences. While base R functions like summary(fit) work well for statisticians, they can be too complex for stakeholders who aren’t familiar with the detailed output. The tbl_regression() function from gtsummary makes it easy to present regression results clearly. In addition, gtsummary is highly versatile - it’s not just limited to linear regression. You can apply it to generalized linear models, survival analyses, and more. The package even allows you to include p-values, confidence intervals, and other important statistics directly within the tables, helping you to better communicate statistical results. Here are a few standout benefits: ✅ Simplified output that’s easier for stakeholders to understand ✅ Works seamlessly with a variety of models ✅ Customizable tables with key statistics like p-values, confidence intervals, and more The visualization included here was originally shared in a post by Dr. Alexander Krannich. Thanks to Alexander for inspiring me to create this post. Interested in more tips on data science, statistics, Python, and R? Be sure to sign up for my free email newsletter! Click this link for detailed information: https://t.co/ktUcWo9XpO #programming #datasciencetraining #DataAnalytics #RStats #R4DS #Rpackage

264

133

10K

Topenomics retweeted

PYOFLIFE.COM @Parajulisaroj16

3 months ago

In this comprehensive guide, we’ll walk you through performing sentiment analysis in R, a powerful programming language for statistical computing and data analysis. https://t.co/SFLubqSLZC #DataScience #RStats #datascientists #machinelearning #datavisualizations #statistics

Parajulisaroj16's tweet photo. In this comprehensive guide, we’ll walk you through performing sentiment analysis in R, a powerful programming language for statistical computing and data analysis. https://t.co/SFLubqSLZC
#DataScience #RStats #datascientists #machinelearning #datavisualizations #statistics https://t.co/7OphpYtiTO

Topenomics retweeted

Curious Minds

@CuriousMindsHub

3 months ago

How To Remember Everything you Read

232K

Topenomics retweeted

𝗿𝗮𝗺𝗮𝗸𝗿𝘂𝘀𝗵𝗻𝗮— 𝗲/𝗮𝗰𝗰

@techwith_ram

3 months ago

Pen & Paper Exercises in Machine Learning by Michael U. Gutmann Book PDF here: https://t.co/FUo4Nb3SIo This book is a collection of (mostly) pen-and-paper exercises in machine learning. Each exercise comes with a detailed solution. The following topics are covered: - Linear algebra - Optimisation - Directed graphical models - Undirected graphical models - Expressive power of graphical models - Factor graphs and message passing - Inference for hidden Markov models - Model-based learning (including ICA and unnormalized models) - Sampling and Monte-Carlo integration - Variational inference Repo: https://t.co/OZ219CXvmQ

techwith_ram's tweet photo. Pen & Paper Exercises in Machine Learning by Michael U. Gutmann

Book PDF here: https://t.co/FUo4Nb3SIo

This book is a collection of (mostly) pen-and-paper exercises in machine learning. Each exercise comes with a detailed solution.

The following topics are covered:
- Linear algebra
- Optimisation
- Directed graphical models
- Undirected graphical models
- Expressive power of graphical models
- Factor graphs and message passing
- Inference for hidden Markov models
- Model-based learning (including ICA and unnormalized models)
- Sampling and Monte-Carlo integration
- Variational inference

Repo: https://t.co/OZ219CXvmQ

733

126

36K

Topenomics retweeted

𝗿𝗮𝗺𝗮𝗸𝗿𝘂𝘀𝗵𝗻𝗮— 𝗲/𝗮𝗰𝗰

@techwith_ram

3 months ago

Introduction to Computational Thinking and Data Science by MIT You can access Lecture PDFs: https://t.co/mG5MLOmye3 Lecture Videos: https://t.co/lM3W1UTYV1

techwith_ram's tweet photo. Introduction to Computational Thinking and Data Science by MIT

You can access Lecture PDFs: https://t.co/mG5MLOmye3

Lecture Videos: https://t.co/lM3W1UTYV1 https://t.co/rpj75Zz5In

803

154

33K

Topenomics retweeted

Mashford Mahute @MashfordMahute

4 months ago

12 FREE TUTORIALS ON SPATIAL DATA MANAGEMENT WITH POSTGIS 🪡🧵

278

291

10K

Topenomics retweeted

Michael Pyrcz🌻

@GeostatsGuy

4 months ago

I’m STOKED to share a new chapter on Monte Carlo Simulation (MCS) in my free, online e-book, Applied Geostatistics in Python! 🤘📘 To better support my students (and the broader community!), I added hands-on, well-documented Python demos with reproducible code to help anyone get started with #DataScience, uncertainty quantification, and simulation. 🧠📊💥 I love building resources that make learning practical, accessible, and empowering — and I’m STOKED to help! 🙌🔥 Check it out here: 👉 https://t.co/9iBVbYnBcX

GeostatsGuy's tweet photo. I’m STOKED to share a new chapter on Monte Carlo Simulation (MCS) in my free, online e-book, Applied Geostatistics in Python! 🤘📘

To better support my students (and the broader community!), I added hands-on, well-documented Python demos with reproducible code to help anyone get started with #DataScience, uncertainty quantification, and simulation. 🧠📊💥

I love building resources that make learning practical, accessible, and empowering — and I’m STOKED to help! 🙌🔥

Check it out here:
👉 https://t.co/9iBVbYnBcX

310

228

11K

Topenomics retweeted

Michael Pyrcz🌻

@GeostatsGuy

4 months ago

In my new free, online #DataScience e-books, I focus on creating effective and impactful visualizations. The first step I take is always to ask: What message am I trying to communicate with this plot? From there, I design the plot specifically to make that message clear and intuitive. Rarely are my visuals the default, generic plots—each one is crafted to enhance understanding and engagement. See my ideas for model and #dataviz with, Applied #MachineLearning in #Python: https://t.co/pq4eWwjPQQ Applied #Geostatistics in Python: https://t.co/KFHZgbnbW0

GeostatsGuy's tweet photo. In my new free, online #DataScience e-books, I focus on creating effective and impactful visualizations.

The first step I take is always to ask: What message am I trying to communicate with this plot? From there, I design the plot specifically to make that message clear and intuitive. Rarely are my visuals the default, generic plots—each one is crafted to enhance understanding and engagement. See my ideas for model and #dataviz with,

Applied #MachineLearning in #Python: https://t.co/pq4eWwjPQQ

Applied #Geostatistics in Python: https://t.co/KFHZgbnbW0

389

273

20K

Chris

@Topenomics

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users