Transcription factors are proteins that bind to short sequence motifs on DNA typically called cisregulatory
elements. These cis-regulatory elements are characterized by their linear base pair sequence as
well as specific features of their three-dimensional structure. These structural features play an important role
in the recognition and binding of proteins to DNA.
Various computational approaches have been used to model protein-DNA interaction interfaces at an
atomic level. We describe their most promising scope of applications and discuss their assets and drawbacks.
Structure-based computational methods require three-dimensional protein-DNA complexes gained either by
X-ray crystallography or by in silico modeling. Docking approaches, molecular dynamics, and Monte Carlo
simulations are promising techniques to model transcription factor-DNA complexes in silico. Both experimentally
determined and ab initio designed protein-DNA complexes can be analyzed by statistical methods.
We describe the differences of several statistical potentials and how they were obtained. Position weight
matrices obtained from structure-based approaches can then be used to scan efficiently and more accurately
genome-wide for transcription factor binding sites.
In a case study on WRKY-DNA complexes we present a computational modeling technique for the ab
initio design of a specific transcription factor-DNA complex. This procedure is generally applicable to similar
problems. The resulting three-dimensional interaction interface provides the basis for studying specific side
chain and base interactions. Moreover, the results give hints towards varying specificity and function of
different representatives of the WRKY protein family. This study provides valuable insights into the interplay
between transcription factors and DNA in three dimensions and opens up new perspectives for their design.
Keywords: three-dimensional structure, 3D, in silico modelling, WRKY, transcription factors, DNA binding domain, DNAprotein
interface, DNA-protein interacion, ab initio design, Monte Carlo simulations, molecular dynamics, docking, forcefield
docking, plant transcription factors, W-box, position specific scoring matricies, simulations.