code

The document outlines a Python script that processes an Excel file to compute mean values for different groups and simplifies taxon names. It prepares the data for plotting and identifies the top 20 taxa based on mean abundance, followed by creating a bar plot without error bars. Finally, the generated plot is saved in both PNG and TIFF formats.

Uploaded by

Sathiyaraj Srinivasan

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

code

Uploaded by

Sathiyaraj Srinivasan

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

import pandas as pd

import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

# Reload the Excel file

file_path = "/mnt/data/processed_sample_analysis.xlsx"
mean_df = pd.read_excel(file_path)

# Recompute group means

mean_df["G1"] = mean_df[["G1_6w_mean", "G1_10w_mean"]].mean(axis=1)
mean_df["G2"] = mean_df[["G2_6w_mean", "G2_10w_mean"]].mean(axis=1)
mean_df["G3"] = mean_df[["G3_6w_mean", "G3_10w_mean"]].mean(axis=1)

# Simplify taxon name

def simplify_taxonomy(clade):
levels = clade.split('|')
known = [l for l in levels if '__' in l and not any(x in l for x in ['GGB',
'SGB', 'CFGB', 'OFGB'])]
return known[-1] if known else clade

mean_df["Taxon"] = mean_df["clade_name"].apply(simplify_taxonomy)

# Prepare data for plotting

plot_ready_df = mean_df[["Taxon", "G1", "G2", "G3"]].melt(id_vars="Taxon",
var_name="Group", value_name="Abundance")
plot_ready_df["log10_abundance"] = np.log10(plot_ready_df["Abundance"] + 1e-6)
plot_ready_df["Group_Label"] = plot_ready_df["Group"].map({"G1": "ND", "G2": "HFD",
"G3": "HFD+EF-2001"})

# Get top 20 taxa

plot_ready_df["Mean_Abundance"] = plot_ready_df.groupby("Taxon")
["Abundance"].transform("mean")
top_20_df = plot_ready_df.sort_values("Mean_Abundance",
ascending=False).drop_duplicates("Taxon").head(20)
top_taxa_names = top_20_df["Taxon"].tolist()
top_plot_df = plot_ready_df[plot_ready_df["Taxon"].isin(top_taxa_names)]

# Organize plotting order

top_plot_df["Group_Order"] = top_plot_df["Group"].map({"G1": 0, "G2": 1, "G3": 2})
top_plot_df["Taxon_Group"] = top_plot_df["Taxon"] + " (" +
top_plot_df["Group_Label"] + ")"
top_plot_df["Sort_Index"] = top_plot_df["Group_Order"] * 1000 + top_plot_df.index

# Plot without error bars

fig, ax = plt.subplots(figsize=(10, 12))
sns.barplot(
data=top_plot_df.sort_values("Sort_Index"),
x="log10_abundance",
y="Taxon_Group",
hue="Group_Label",
dodge=False,
ci=None, # No error bars
palette={"ND": "green", "HFD": "blue", "HFD+EF-2001": "red"},
ax=ax
)

ax.set_xlabel("LDA SCORE (log10 abundance)", fontsize=12)

ax.set_ylabel("Taxonomic Group by Sample Group", fontsize=12)
ax.set_title("Top 20 Discriminative Taxa", fontsize=14)
ax.legend(title="Group")
plt.tight_layout()

# Save images
png_path_no_error = "/mnt/data/top20_taxa_grouped_plot_no_error.png"
tiff_path_no_error = "/mnt/data/top20_taxa_grouped_plot_no_error.tiff"
fig.savefig(png_path_no_error, dpi=600)
fig.savefig(tiff_path_no_error, dpi=600)

png_path_no_error, tiff_path_no_error

How The Brain Works The Facts Visually Explained-101-150
No ratings yet
How The Brain Works The Facts Visually Explained-101-150
50 pages
NLP Lab
No ratings yet
NLP Lab
18 pages
Mayank Chaudhary DEV Practicals
No ratings yet
Mayank Chaudhary DEV Practicals
14 pages
Da Programs
No ratings yet
Da Programs
10 pages
Https Raw - Githubusercontent.com Joelgrus Data-Science-From-Scratch Master Code Natural Language Processing
No ratings yet
Https Raw - Githubusercontent.com Joelgrus Data-Science-From-Scratch Master Code Natural Language Processing
5 pages
Machine Learning Lab Record: Dr. Sarika Hegde
No ratings yet
Machine Learning Lab Record: Dr. Sarika Hegde
23 pages
DA2_using-matplotlib
No ratings yet
DA2_using-matplotlib
4 pages
python pandas
No ratings yet
python pandas
13 pages
Machine Learning Laboratory (21AIL66)
No ratings yet
Machine Learning Laboratory (21AIL66)
7 pages
program - 3
No ratings yet
program - 3
4 pages
Vertopal.com 01 MichaelHarris WinningPatterns
No ratings yet
Vertopal.com 01 MichaelHarris WinningPatterns
16 pages
Rezolvate Info Colocviu 1
No ratings yet
Rezolvate Info Colocviu 1
9 pages
Machine Learning Lab (17CSL76)
No ratings yet
Machine Learning Lab (17CSL76)
48 pages
Stat Lab
No ratings yet
Stat Lab
24 pages
ML Lab Manual
No ratings yet
ML Lab Manual
28 pages
ML Lab Codes
No ratings yet
ML Lab Codes
14 pages
Ass
No ratings yet
Ass
5 pages
LAB 1 (1)_merged
No ratings yet
LAB 1 (1)_merged
6 pages
AIML
No ratings yet
AIML
12 pages
Coding Notes Data Science
No ratings yet
Coding Notes Data Science
4 pages
BDS306B_Module5
No ratings yet
BDS306B_Module5
5 pages
KE Lab Codes
No ratings yet
KE Lab Codes
9 pages
ML LAB P-1
No ratings yet
ML LAB P-1
10 pages
PythonForMachineLearning
No ratings yet
PythonForMachineLearning
66 pages
MLRecord
No ratings yet
MLRecord
24 pages
Oxy Metre
No ratings yet
Oxy Metre
17 pages
Practical No 05
No ratings yet
Practical No 05
4 pages
Prac 5
No ratings yet
Prac 5
3 pages
Code
No ratings yet
Code
2 pages
MLSolutions
No ratings yet
MLSolutions
4 pages
ML Lab
No ratings yet
ML Lab
7 pages
Untitled document-2-1-13-7-11.4
No ratings yet
Untitled document-2-1-13-7-11.4
5 pages
External
No ratings yet
External
11 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
32 pages
JPMC - Task 4
No ratings yet
JPMC - Task 4
3 pages
DOC-20250211-WA0009. (1)
No ratings yet
DOC-20250211-WA0009. (1)
26 pages
Notes Dv
No ratings yet
Notes Dv
19 pages
C121 Exp1
No ratings yet
C121 Exp1
32 pages
IR - 754 All Practical
No ratings yet
IR - 754 All Practical
21 pages
EDS - Python Cheat Sheet
No ratings yet
EDS - Python Cheat Sheet
3 pages
Implementing Metaheuristic Algoritham: (Genetic Algorithm) Objective Lab Tasks: Code
No ratings yet
Implementing Metaheuristic Algoritham: (Genetic Algorithm) Objective Lab Tasks: Code
9 pages
Lecture 22
No ratings yet
Lecture 22
64 pages
Principal Component Analysis Notes : Info
No ratings yet
Principal Component Analysis Notes : Info
22 pages
ML Lab Record
No ratings yet
ML Lab Record
33 pages
Aiml Lab Manual 2023
No ratings yet
Aiml Lab Manual 2023
17 pages
RRRRRRRRRRRRRRR
No ratings yet
RRRRRRRRRRRRRRR
4 pages
Experiment 1 solution
No ratings yet
Experiment 1 solution
5 pages
Aiml Lab
No ratings yet
Aiml Lab
14 pages
ML Lab Manual PDF
No ratings yet
ML Lab Manual PDF
9 pages
AML_code_for_m2
No ratings yet
AML_code_for_m2
7 pages
Artificial Intelligence (18Csc305J) Lab: EXPERIMENT 13: Implementation of NLP Problem
No ratings yet
Artificial Intelligence (18Csc305J) Lab: EXPERIMENT 13: Implementation of NLP Problem
9 pages
Code shabab error 7
No ratings yet
Code shabab error 7
5 pages
EDA Python Guide
No ratings yet
EDA Python Guide
11 pages
R语言基础入门指令 (tips)
No ratings yet
R语言基础入门指令 (tips)
14 pages
Loading Pandas
No ratings yet
Loading Pandas
23 pages
ML Lab Programs
No ratings yet
ML Lab Programs
18 pages
Unit1 ML Programs
No ratings yet
Unit1 ML Programs
5 pages
ml_labmanual (3)
No ratings yet
ml_labmanual (3)
33 pages
Edx Course Lab Programs
No ratings yet
Edx Course Lab Programs
19 pages
AI_lab(manual)
No ratings yet
AI_lab(manual)
11 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Succulents Catalog 2022-2023 Online
No ratings yet
Succulents Catalog 2022-2023 Online
44 pages
HSC Part-I Grade - Xi Condensed Course of Academic Year 2020-21
No ratings yet
HSC Part-I Grade - Xi Condensed Course of Academic Year 2020-21
27 pages
Bio Zone
No ratings yet
Bio Zone
8 pages
10-The Nano World and Gene Therapy
No ratings yet
10-The Nano World and Gene Therapy
9 pages
Bio Project
No ratings yet
Bio Project
15 pages
Bacillus Anthracis Powerpoint
100% (1)
Bacillus Anthracis Powerpoint
32 pages
Research Article: Tuti Sri Hastuti, Rachmat Sumantri, Indra Wijaya
No ratings yet
Research Article: Tuti Sri Hastuti, Rachmat Sumantri, Indra Wijaya
8 pages
Vuzv Org Chart 2018 en
No ratings yet
Vuzv Org Chart 2018 en
1 page
9 TiengAnh HSG12 2024 DE
No ratings yet
9 TiengAnh HSG12 2024 DE
6 pages
2 Dan 3. Metabolisme Xenobiotik
No ratings yet
2 Dan 3. Metabolisme Xenobiotik
137 pages
Kỳ Anh Lần 1- Năm Học 2023- 2024
No ratings yet
Kỳ Anh Lần 1- Năm Học 2023- 2024
5 pages
10,11 Lecture Presentation2024
No ratings yet
10,11 Lecture Presentation2024
110 pages
Single-Stranded RNA Phages-From Molecular Biology To Nanotechnology 1st Edition Paul Pumpens (Author) All Chapter Instant Download
100% (3)
Single-Stranded RNA Phages-From Molecular Biology To Nanotechnology 1st Edition Paul Pumpens (Author) All Chapter Instant Download
62 pages
Form Evolution in Nature and in Architecture
No ratings yet
Form Evolution in Nature and in Architecture
81 pages
Fce
No ratings yet
Fce
5 pages
Patterns in Nature and The Mathematics Behind It: Peng Feng
No ratings yet
Patterns in Nature and The Mathematics Behind It: Peng Feng
58 pages
Form 5 Biology Peka
No ratings yet
Form 5 Biology Peka
4 pages
Advancing Antibodies Through The Pipeline Delivering Effective Therapies
No ratings yet
Advancing Antibodies Through The Pipeline Delivering Effective Therapies
4 pages
General Chapters - 1222 - Terminally Sterilized Pharmaceutical Products-Parametric Release
0% (1)
General Chapters - 1222 - Terminally Sterilized Pharmaceutical Products-Parametric Release
5 pages
Each Muscle Is Served by One Artery, One Nerve, and One or More Veins
No ratings yet
Each Muscle Is Served by One Artery, One Nerve, and One or More Veins
58 pages
Methods of Self Incompatibility
No ratings yet
Methods of Self Incompatibility
4 pages
Name: Shany C. Bantayanon Grade & Sec. 12D-STEM Date: 08/31/21 Learning Activity No. 2 Organelle Nicknames
No ratings yet
Name: Shany C. Bantayanon Grade & Sec. 12D-STEM Date: 08/31/21 Learning Activity No. 2 Organelle Nicknames
3 pages
Chattanooga Intelect Advanced Brochure Manual
No ratings yet
Chattanooga Intelect Advanced Brochure Manual
32 pages
P.E 12 2ndsem 1st
No ratings yet
P.E 12 2ndsem 1st
31 pages
Differential Expression Analysis With Deseq2: Dr. Kathi Zarnack
No ratings yet
Differential Expression Analysis With Deseq2: Dr. Kathi Zarnack
8 pages
Mendelian Randomization Methods for Causal Inference Using Genetic Variants 2nd Edition Stephen Burgess 2024 scribd download
100% (1)
Mendelian Randomization Methods for Causal Inference Using Genetic Variants 2nd Edition Stephen Burgess 2024 scribd download
67 pages
DND2024 Monster Manual
No ratings yet
DND2024 Monster Manual
6 pages
Southeast Asian Cooking Learn Easy Southeast Asian Cooking With Delicious Southeast Asian Recipes Booksumo Press 2024 Scribd Download
100% (7)
Southeast Asian Cooking Learn Easy Southeast Asian Cooking With Delicious Southeast Asian Recipes Booksumo Press 2024 Scribd Download
52 pages
Polycystic Kidney Disease Pathophysiology
No ratings yet
Polycystic Kidney Disease Pathophysiology
2 pages

code

Uploaded by

code

Uploaded by

import pandas as pd

# Reload the Excel file

# Recompute group means

# Simplify taxon name

# Prepare data for plotting

# Get top 20 taxa

# Organize plotting order

# Plot without error bars

ax.set_xlabel("LDA SCORE (log10 abundance)", fontsize=12)

You might also like