(68) Experimental Sequence Analysis System

	Machine:     PIM
	Environment: PIMOS
	Language:    KL1
	Source Code: 200 KB
	Documents:   Manual (English/Japanese)


Overview

Extracts common patterns(motifs) of amino acid sequences in some protein categories.

Characteristics

Motif extraction is one of the important problems in genetic information processing. Motifs are common patterns of amino acid sequences in the same protein category, which are conserved in the evolution process and characterize the function/structure of proteins. The system extracts motifs using the MDL principle and genetic algorithms.

Function

As a criterion for motif evaluation, the minimum description length(MDL) principle was adopted. The MDL principle selects a motif with the shortest description length ( = Complexity of motif + Classification error rate). It enables us to compare a simple motif with exceptions and a complex motif without exceptions.

As a motif search method, genetic algorithms (GA) were adopted to deal with enormous search space. GA realizes probabilistic search by applying genetic operations to a population of motif candidates represented by binary strings. The genetic operations consist of crossover, mutation and selection. The MDL principle plays an essential role in selection.

Three kinds of parallelism can be exploited in the system;trial, divide-and-conquer and data parallelism. They enable us to utilize the Parallel Inference Machine (PIM) effectively.

Bibliography

  1. Konagaya, Stochastic Motifs, Genome Informatics Workshop II, 1991. (in Japanese)
  2. Konagaya, et al., The MDL Principle and Genetic Information Processing, Proceedings of JSAI, 1991. (in Japanese)
  3. Konagaya, et al., Stochastic Decision Predicate:A Scheme to Represent Motifs, AAAI Workshop, 1991.
  4. Koyanagi el al., Protein Motif Extraction using Multi-PSI, Genome Informatics Workshop II, 1991. (in Japanese)
  5. Yamanishi, et al., Learning Stochastic Motifs from Genetic Sequences, Machine Learning Workshop, 1991.

FTP


www-admin@icot.or.jp