AMBER and Molecular Simulation File Formats
This document provides an overview of common file formats used in AMBER-based molecular modeling and molecular dynamics workflows. It describes each file’s format, purpose, and typical producer–consumer relationships within simulation pipelines.
Overview Table
The following table summarizes the most commonly encountered file formats.
File name / extension |
File type / format |
Purpose / role |
Producer → Consumer |
|---|---|---|---|
|
Text (Tripos MOL2) |
Stores atom types, bonding, residue definitions, and partial charges for small molecules or nonstandard residues; primary input for parameterization workflows. |
External QM tools / Antechamber → Antechamber, |
|
Text (AMBER force-field modification) |
Provides missing or custom force-field parameters (bond, angle, dihedral, van der Waals) not present in standard AMBER force fields. |
Antechamber / |
|
Text (AMBER library/object) |
Defines complete residues or molecules including atom names, types, charges, and connectivity for system construction. |
|
|
Text (AMBER prep format) |
Legacy residue definition format similar to |
Antechamber / user → |
|
Text (PDB with charges) |
Human-readable mapping of partial charges onto atomic coordinates for validation, inspection, or visualization. |
Antechamber / user → visualization tools |
|
Text (Protein Data Bank) |
Stores atomic coordinates and connectivity for biomolecules or complexes; often used as starting structural input. |
Experimental data / modeling tools → |
|
Text (AMBER coordinate format) |
Stores initial atomic coordinates and box information for simulations;
often paired with |
|
|
Text (AMBER topology) |
Contains complete force-field topology: atom types, charges, bonded and nonbonded parameters; essential for all simulations. |
|
|
Text or binary (AMBER restart) |
Stores simulation state (coordinates, optionally velocities and box) to restart or continue simulations. |
|
|
Text (AMBER trajectory) |
Stores time series of atomic coordinates during MD; legacy trajectory format. |
|
|
Binary (NetCDF trajectory) |
Efficient, portable trajectory format storing coordinates, velocities, and/or forces; preferred modern trajectory format. |
|
|
Text (AMBER input control) |
Specifies simulation parameters, run type, restraints, and algorithms for MD or minimization. |
User → |
|
Text (AMBER output log) |
Records simulation progress, energies, temperatures, and warnings for monitoring and diagnostics. |
|
|
Text or binary (energy output) |
Stores per-step energy components for post-processing and analysis. |
|
|
Binary (compressed restart) |
Binary restart format optimized for performance in large simulations. |
|
|
Text (topology alias) |
Alternative naming for AMBER topology files, functionally identical
to |
|
|
Text (coordinate alias) |
Legacy coordinate file naming; often synonymous with |
|
|
Text (cpptraj script) |
Defines analysis operations (RMSD, clustering, free energy post-processing) for trajectory analysis. |
User → |
Notes on Usage
Topology + coordinates are always required together for simulations (e.g.,
.prmtop+.inpcrdor.rst7).NetCDF (``.nc``) trajectories are strongly recommended over legacy
.mdcrdfor performance and portability.Legacy formats such as
.prepand.crdare retained mainly for backward compatibility.Human-readable intermediates (e.g.,
_charges.pdb) are useful for validation but are not used directly in production simulations.