FTP Services

FTP Access

Searches and reports performed on this RCSB PDB website utilize data from the PDB archive. The PDB archive is maintained by the wwPDB at ftp.wwpdb.org (data download details).

Note: Users should switch to binary mode before downloading data files.

Major Directories

The directory pub/pdb is the entry directory for the ftp site.

The directory pub/pdb/data/structures/divided contains the current PDB contents including PDB, mmCIF, and PDBML/XML formatted coordinate files, structure factors and NMR restraints:

Further details

Automated Download of Data from the PDB FTP Archive

The RCSB PDB also provides some example scripts to assist in the automated download of data from the ftp site.

Additional information on obtaining and maintaining copies of the entire PDB archive or certain portions of it is available at http://www.wwpdb.org/downloads.html.

Additional RCSB PDB FTP services

A supplemental ftp archive is solely maintained by the RCSB PDB at ftp://resources.rcsb.org.

This table summarizes the contents of ftp://resources.rcsb.org, a supplemental ftp site maintained solely by the RCSB PDB. Clicking on a directory or file name will open that content.

Directory or File Contents
/sequence/clusters/bc-30.out
/sequence/clusters/bc-40.out
/sequence/clusters/bc-50.out
/sequence/clusters/bc-70.out
/sequence/clusters/bc-90.out
/sequence/clusters/bc-95.out
/sequence/clusters/bc-100.out
Results of the weekly clustering of protein chains in the PDB by BLASTClust at 30%, 40%, 50%, 70%, 90%, 95%, and 100% sequence identity. For more information, see Redundancy in the Protein Data Bank.
/sequence/clusters/clusters50.txt
/sequence/clusters/clusters70.txt
/sequence/clusters/clusters90.txt
/sequence/clusters/clusters95.txt
Results of the weekly clustering of protein chains in the PDB by cd-hit at 50%, 70%, 90%, and 95% sequence identity.
/sequence/clusters/not_in_clusters.txt Contains nucleic acid chains and short polypeptides of fewer than 20 amino acids, which are not clustered.
/files/split_biol_assembly.txt List of split entries (structures split across multiple PDB files) and their biological assemblies
/fatcat_rigid_pdb_all/fatcat_rigid_pdb_all.txt.gz Data for all vs. all structure alignments for the full PDB archive, as described in Prlic et al Bioinformatics 2010
/protmod/protmod.tsv.gz Protein modification data for the full PDB archive, as described in Gao et al Bioinformatics 2017
Last Updated: April 13, 2010

Obtaining Files that used to be in the Brookhaven PDB FTP Archive

The RCSB PDB no longer maintains an up-to-date copy of the BNL PDB FTP archive. Please click here for more information.