BioGem Blog

Posts

Compiling and Creating locBLAST Image using Docker

January 24, 2026

loc BLAST is a PHP library that provides a graphical user interface (GUI) for the command-line NCBI BLAST+ programs. The official Docker image of loc BLAST is available on Docker Hub . 💿 Using NCBI Docker Images for Web BLAST The most straightforward way to run a web-based BLAST service with Docker is to use a pre-built image provided by the NCBI: NCBI BLAST+ Command Line Tools: The NCBI provides official Docker images for the standalone command-line BLAST+ suite, which can be found on their GitHub page and Docker Hub . Latest version: The ncbi/blast:latest tag generally points to the most recent stable release. Specific versions: You can specify a particular version, e.g., ncbi/blast:2.14.1 , to ensure reproducibility. 🛠️ Setting up loc BLAST in a Docker Environment To use loc BLAST , you would need to deploy it within a web server environment that also has access to the NCBI BLAST+ binaries. loc BLAST requires a web serv...

3D Protein Structure Prediction Server AlphaFold 3

January 21, 2026

The AlphaFold Server is a free, web-based platform launched by Google DeepMind and Isomorphic Labs to provide the scientific community with access to AlphaFold 3 . While the original AlphaFold was revolutionary for predicting protein structures, the AlphaFold 3 server expands this capability to virtually all “life’s molecules,” allowing researchers to model how proteins interact with DNA, RNA, ligands, and more in a single, unified system. 🧬 Key Capabilities Unlike its predecessors, the AlphaFold 3 server is a multimodal model. It doesn't just fold proteins; it predicts the joint 3D structure of complex molecular assemblies. Protein-Ligand Interactions: Accurately models how small molecules (drugs) bind to proteins, showing a 50% improvement over traditional docking methods. Nucleic Acids: Predicts the structures of DNA and RNA and how they interact with proteins (e.g., transcription factors or CRISPR complexes). Chemical Modifications: Pr...

Bioinformatics Protocol for NGS Data Analysis

October 12, 2025

A step-by-step Bioinformatics protocol for Next-Generation Sequencing (NGS) are, Data quality control using tools like FastQC to assess raw data. Data preprocessing for adapter trimming and low-quality base removal with tools like Trimmomatic or FastP. Read mapping to a reference genome using aligners such as BWA or Bowtie2. Post-alignment processing including duplicate removal with Picard and variant calling with GATK or Samtools. Downstream analysis and visualization for specific applications like differential gene expression or variant interpretation using tools like R packages or IGV. A more detailed breakdown of those were given below 1. Data Quality Control (QC) Purpose: To check the quality of the raw sequencing reads and identify any potential issues. Tools: FastQC: A widely used tool to generate quality control reports for raw sequencing data. Output: A report summarizing metrics like Phred scores, adapter contamination, and sequence qu...

Creating 2D Line Plot using GNU Plot Software

July 16, 2025

A simple Gnuplot script designed to generate a customized 2D line plot of potential energy data (presumably vs. simulation steps or time), using data from a file named PE.txt (available in GitHub ). The script applies several visual formatting settings to make the plot visually appealing and informative. This script produces a clean, stylized 2D plot of potential energy vs. simulation steps with (1) custom fonts, colors, and line styles, (2) no legend, (3) margins around the x-axis, and (4) automatic y-axis scaling. Source Code set title '{/Times-New-Roman=14:Bold Potential Energy Plot}' tc rgb '#167116' set xlabel '{/Arial:Italic Number of steps}' tc rgb 'red' set ylabel '{/Arial:Italic Potential energy}' tc rgb 'red' set style line 1 lt 1 lc rgb '#f70453' lw 0.5 set grid layerdefault lt 0 lc rgb 'blue' lw 0.5 set border lt 1 lc rgb 'blue' lw 1 unset key plot 'PE.txt' with lines ls 1 set xrang...

Molecular Dynamics Simulation of Micromolecules using Chimera

July 07, 2025

Performing a Molecular Dynamics (MD) simulation of a small molecule in UCSF Chimera involves a series of steps to prepare the molecule, set up the simulation environment, run the simulation, and finally, analyze the resulting trajectory. Here's a step-by-step guide for the same: 1. Loading and Preparing the Small Molecule Structure Open Chimera: Launch UCSF Chimera or ChimeraX. Load your molecule: Import your small molecule structure into Chimera using File > Open or File > Fetch by ID if the structure is available in a database like the Protein Data Bank (PDB). Add Hydrogens: Use the "Molecular Dynamics Simulation" tool's "Prep Structure" section to add hydrogens. You might also be able to use the addh command. Assign Force Field Parameters: Since you are working with a small molecule (a nonstandard residue), you will use Amber's Antechamber module, which is included in Chimera, to assign force field parameters. This involves ass...

Constructing Phylogenetic Tree using UPGMA Method

June 17, 2025

UPGMA (Unweighted Pair Group Method with Arithmetic Mean) is a distance-based method for constructing phylogenetic trees. It works by iteratively clustering the two closest groups of sequences together, forming a new cluster until all sequences are grouped into a single tree. The distances between clusters are calculated using the average of all pairwise distances between sequences within those clusters. UPGMA produces rooted trees, meaning it has a defined root representing the common ancestor. Here's a more detailed explanation: 1. Distance Matrix UPGMA begins with a distance matrix, which contains the pairwise distances between all sequences being compared. These distances can be based on sequence alignment, protein structure comparisons, or other relevant metrics. \[D_{i,j}=\max\begin{cases}D_{i-1,j-1} & + & s(a_i,b_j) \\D_{i-1,j} & + & s(a_i,-) \\D_{i,j-1} & + & s(-,b_j)\end{cases}=\max\begin{cases}D_{i-1,j-1}& + ...

Search This Blog