|
Abstract
Here, we propose a model to predict transcript production rates or levels from genes' promoter occupancy profiles, and apply it to two types of problems. The promoter occupancy profile is produced by COMPETE, a software package implementing a model of continuous DNA binding that considers competition by diverse DBFs at given concentrations, and capable of doing so at genome-wide scale and single-nucleotide resolution.
Our model takes promoter sequences and DBF concentrations as inputs, generating transcript production rates or levels as outputs. Thereby, there are two types of problems. One studies the response of transcript production as the TF concentrations change over time, fixing promoter sequence and nucleosome concentration as other inputs. The other is about transcript production rate/level being being up or down, after mutations are executed on promoters. For the first type, the model is applied in the S. cerevisiae yeast to predict the instantaneous transcript production rate during the yeast cell cycle for each gene, given its promoter occupancy profile. For the second type, we apply the model in S. cerevisiae yeast to predict the averaged transcript production rate of mutant promoters and new promoters which have not been seen before in training set.