|
Models, MolecularLatest Report:
ProteinSplit is a rapid method designed to predict ordered and disordered regions in a protein sequence in order to locate domain boundaries. Our algorithms can be used in a typical high throughput modeling environment in large-scale genome annotation projects. A consensus algorithm utilized by ProteinSplit allows for fast scanning of a given protein sequence dataset. Our implementation of the method provides also a single protein mode for detailed, manual analysis and a semi-automated mode with multiple user access for mid-scale projects. Acknowledgements This work has been supported by MEiN and the European Commission with grants LSHG-CT-2004-512035 and LSHG-CT-2003-503265 and MEiN with grant 2P05A00130. LSW is a fellow of the Young Scientist Award from The Foundation for Polish Science. Authors: Wyrwicz LS, Koczyk G, Rychlewski L, Plewczynski D
Latest Paper:
Medical Research Council, Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK.
Designer amino acids, beyond the canonical 20 that are normally used by cells, can now be site-specifically encoded into proteins in cells and organisms. This is achieved using 'orthogonal' aminoacyl-tRNA synthetase-tRNA pairs that direct amino acid incorporation in response to an amber stop codon (UAG) placed in a gene of interest. Using this approach, it is now possible to study biology in vitro and in vivo with an increased level of molecular precision. This has allowed new biological insights into protein conformational changes, protein interactions, elementary processes in signal transduction and the role of post-translational modifications.
Most cited papers:
Department of Molecular Biology, BMC, Uppsala, Sweden.
Map interpretation remains a critical step in solving the structure of a macromolecule. Errors introduced at this early stage may persist throughout crystallographic refinement and result in an incorrect structure. The normally quoted crystallographic residual is often a poor description for the quality of the model. Strategies and tools are described that help to alleviate this problem. These simplify the model-building process, quantify the goodness of fit of the model on a per-residue basis and locate possible errors in peptide and side-chain conformations.
Alex Bateman,
Lachlan Coin,
Richard Durbin,
Robert D Finn,
Volker Hollich,
Sam Griffiths-Jones,
Ajay Khanna,
Mhairi Marshall,
Simon Moxon,
Erik L L Sonnhammer,
David J Studholme,
Corin Yeats,
Sean R Eddy
Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.
Pfam is a large collection of protein families and domains. Over the past 2 years the number of families in Pfam has doubled and now stands at 6190 (version 10.0). Methodology improvements for searching the Pfam collection locally as well as via the web are described. Other recent innovations include modelling of discontinuous domains allowing Pfam domain definitions to be closer to those found in structure databases. Pfam is available on the web in the UK (http://www.sanger.ac.uk/Software/Pfam/), the USA (http://pfam.wustl.edu/), France (http://pfam.jouy.inra.fr/) and Sweden (http://Pfam.cgb.ki.se/).
Eric F Pettersen,
Thomas D Goddard,
Conrad C Huang,
Gregory S Couch,
Daniel M Greenblatt,
Elaine C Meng,
Thomas E Ferrin
Computer Graphics Laboratory, Department of Pharmaceutical Chemistry, University of California, 600 16th Street, San Francisco, California 94143-2240, USA.
The design, implementation, and capabilities of an extensible visualization system, UCSF Chimera, are discussed. Chimera is segmented into a core that provides basic services and visualization, and extensions that provide most higher level functionality. This architecture ensures that the extension mechanism satisfies the demands of outside developers who wish to incorporate new features. Two unusual extensions are presented: Multiscale, which adds the ability to visualize large-scale molecular assemblies such as viral coats, and Collaboratory, which allows researchers to share a Chimera session interactively despite being at separate locales. Other extensions include Multalign Viewer, for showing multiple sequence alignments and associated structures; ViewDock, for screening docked ligand orientations; Movie, for replaying molecular dynamics trajectories; and Volume Viewer, for display and analysis of volumetric data. A discussion of the usage of Chimera in real-world situations is given, along with anticipated future directions. Chimera includes full user documentation, is free to academic and nonprofit users, and is available for Microsoft Windows, Linux, Apple Mac OS X, SGI IRIX, and HP Tru64 Unix from http://www.cgl.ucsf.edu/chimera/.
York Structural Biology Laboratory, University of York, Heslington, York YO10 5YW, England. emsley@ysbl.york.ac.uk
CCP4mg is a project that aims to provide a general-purpose tool for structural biologists, providing tools for X-ray structure solution, structure comparison and analysis, and publication-quality graphics. The map-fitting tools are available as a stand-alone package, distributed as 'Coot'.
The X-ray crystal structure of the nucleosome core particle of chromatin shows in atomic detail how the histone protein octamer is assembled and how 146 base pairs of DNA are organized into a superhelix around it. Both histone/histone and histone/DNA interactions depend on the histone fold domains and additional, well ordered structure elements extending from this motif. Histone amino-terminal tails pass over and between the gyres of the DNA superhelix to contact neighbouring particles. The lack of uniformity between multiple histone/DNA-binding sites causes the DNA to deviate from ideal superhelix geometry.
Department of Mathematical Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, USA. zukerm@rpi.edu
The abbreviated name,'mfold web server', describes a number of closely related software applications available on the World Wide Web (WWW) for the prediction of the secondary structure of single stranded nucleic acids. The objective of this web server is to provide easy access to RNA and DNA folding and hybridization software to the scientific community at large. By making use of universally available web GUIs (Graphical User Interfaces), the server circumvents the problem of portability of this software. Detailed output, in the form of structure plots with or without reliability information, single strand frequency plots and 'energy dot plots', are available for the folding of single sequences. A variety of 'bulk' servers give less information, but in a shorter time and for up to hundreds of sequences at once. The portal for the mfold web server is http://www.bioinfo.rpi.edu/applications/mfold. This URL will be referred to as 'MFOLDROOT'.
European Molecular Biology Laboratory, Heidelberg, Federal Republic of Germany.
With a rapidly growing pool of known tertiary structures, the importance of protein structure comparison parallels that of sequence alignment. We have developed a novel algorithm (DALI) for optimal pairwise alignment of protein structures. The three-dimensional co-ordinates of each protein are used to calculate residue-residue (C alpha-C alpha) distance matrices. The distance matrices are first decomposed into elementary contact patterns, e.g. hexapeptide-hexapeptide submatrices. Then, similar contact patterns in the two matrices are paired and combined into larger consistent sets of pairs. A Monte Carlo procedure is used to optimize a similarity score defined in terms of equivalent intramolecular distances. Several alignments are optimized in parallel, leading to simultaneous detection of the best, second-best and so on solutions. The method allows sequence gaps of any length, reversal of chain direction and free topological connectivity of aligned segments. Sequential connectivity can be imposed as an option. The method is fully automatic and identifies structural resemblances and common structural cores accurately and sensitively, even in the presence of geometrical distortions. An all-against-all alignment of over 200 representative protein structures results in an objective classification of known three-dimensional folds in agreement with visual classifications. Unexpected topological similarities of biological interest have been detected, e.g. between the bacterial toxin colicin A and globins, and between the eukaryotic POU-specific DNA-binding domain and the bacterial lambda repressor.
Geneva Biomedical Research Institute, Glaxo Wellcome Research and Development, Plan-les-Ouates/Geneva, Switzerland.
Comparative protein modeling is increasingly gaining interest since it is of great assistance during the rational design of mutagenesis experiments. The availability of this method, and the resulting models, has however been restricted by the availability of expensive computer hardware and software. To overcome these limitations, we have developed an environment for comparative protein modeling that consists of SWISS-MODEL, a server for automated comparative protein modeling and of the SWISS-PdbViewer, a sequence to structure workbench. The Swiss-PdbViewer not only acts as a client for SWISS-MODEL, but also provides a large selection of structure analysis and display tools. In addition, we provide the SWISS-MODEL Repository, a database containing more than 3500 automatically generated protein models. By making such tools freely available to the scientific community, we hope to increase the use of protein structures and models in the process of experiment design.
Syngenta Agricultural Discovery Institute, 3115 Merryfield Row, Suite 100, San Diego, CA 92121, USA.
We describe a largely unbiased method for rapid and large-scale proteome analysis by multidimensional liquid chromatography, tandem mass spectrometry, and database searching by the SEQUEST algorithm, named multidimensional protein identification technology (MudPIT). MudPIT was applied to the proteome of the Saccharomyces cerevisiae strain BJ5460 grown to mid-log phase and yielded the largest proteome analysis to date. A total of 1,484 proteins were detected and identified. Categorization of these hits demonstrated the ability of this technology to detect and identify proteins rarely seen in proteome analysis, including low-abundance proteins like transcription factors and protein kinases. Furthermore, we identified 131 proteins with three or more predicted transmembrane domains, which allowed us to map the soluble domains of many of the integral membrane proteins. MudPIT is useful for proteome analysis and may be specifically applied to integral membrane proteins to obtain detailed biochemical information on this unwieldy class of proteins.
Department of Embryology, Carnegie Institution of Washington, Baltimore, MD 21210.
A 30-amino-acid segment of C/EBP, a newly discovered enhancer binding protein, shares notable sequence similarity with a segment of the cellular Myc transforming protein. Display of these respective amino acid sequences on an idealized alpha helix revealed a periodic repetition of leucine residues at every seventh position over a distance covering eight helical turns. The periodic array of at least four leucines was also noted in the sequences of the Fos and Jun transforming proteins, as well as that of the yeast gene regulatory protein, GCN4. The polypeptide segments containing these periodic arrays of leucine residues are proposed to exist in an alpha-helical conformation, and the leucine side chains extending from one alpha helix interdigitate with those displayed from a similar alpha helix of a second polypeptide, facilitating dimerization. This hypothetical structure is referred to as the "leucine zipper," and it may represent a characteristic property of a new category of DNA binding proteins.
|
Polish News |
|
||
|
|