Ian Korf's Home Page


You have reached the home page of Ian Korf. I am one of the new faculty at the UC Davis Genome Center.

Contact me via email: ifkorf@ucdavis.edu


Research Interests

My computational molecular biology research seeks to understand structure and function in genomic DNA. Since my research involves methodolgies from text processing (exact and inexact string matching) and speech recognition (hidden Markov models), and because comparative genomics provides a Rosetta Stone-like guidance, my research is very much like reading the book of life.
Gene Prediction
Despite roughly 20 years of research, gene prediction algorithms cannot accurately derive a proteome from a genome. I'm trying to bring a biologists perspective to the field. My latest effort is called SNAP.
Comparative Genomics
Just as the ancient Greeks used comparative anatomy to understand the human body, I'm using comparative genomics to understand the human genome (and other genomes).
Genome Annotation
Genome annotation seeks to label regions of a genome with functional descriptors like "gene" or "repeat". Identifying the complete set of human genes is probably the most obvious goal of genome annotation today. Annotation often relies on expert biologists and expert systems to determine the correct labeling of a sequence. I'm interested in improving both manual and automated annotation and identifying standards in this new field.
Developmental Regulation
My formal training is in molecular and developmental biology and I am still very interested in this area. In a sense, all my interests are fueled by a curiosity about how genes are regulated in space, time, and clade.

Publications

Cool Stuff

Software Packages

  • SNAP gene prediction program and some standard data sets of genes.
  • MaskerAid Make RepeatMasker fly!
  • Twinscan Genscan-like gene predictor using genomic homology.
  • PolyBayes A Bayesian approach to identifying SNPs from multiple alignments.
  • MyGenBank Manage a local copy of GenBank with MySQL.
  • AHA Flexible sequence analysis pipeline that exports a portable ACEDB instance.

Perl Freebies

The software here is unsupported, free software. Please report bugs, but don't expect timely updates.
  • DataBrowser.pm Survival tool for when you're knee-deep in Perl data structures.
  • BPlite.pm Simple BLAST parser with a clean, object-oriented interface.
  • FAlite.pm Convenient interface for parsing FASTA files.
  • GBlite.pm Parses GenBank flat files. Used by MyGenBank.
  • codon.pl Ever wonder how many unambiguous translations there are for ambiguous codons?
  • mpblast.pl Make BLASTN 10x faster on batch jobs of short sequences.
  • plotBlast.pl Makes graphical BLAST reports.
  • tregex.pl Search for protein patterns in EST databases.
  • xblast.pl Perl version of Claverie's xblast.