Algorithms and programming languages for bioinformatics - ALGORITMI PER BIOINFORMATICA (2010/2011)

Course not running

Course code
4S000525
Name of lecturer
Alberto Castellini
Number of ECTS credits allocated
6
Academic sector
INF/01 - INFORMATICS
Language of instruction
Italian
Location
VERONA
Period
I semestre dal Oct 4, 2010 al Jan 31, 2011.
Web page
http://www.albertocastellini.tk/teaching.html

To show the organization of the course that includes this module, follow this link * Course organization

Lesson timetable

Learning outcomes

The aim of this course is to provide tools for dealing with some typical problems in bioinformatics, such as the analysis of biological data, the representation of biological systems by suitable models and the simulation of such systems. Mathematical and statistical tools for bioinformatics will be examined along with the Java EE platform, which enables to develop web services and web application, a new generation of bioinformatic tools. The analysis of some case studies and laboratory classes will enable to understand how methodologies presented during the course can be used in practice.

Syllabus

OUTLINES OF DESCRIPTIVE AND INFERENTIAL STATISTICS, AND DATAMINING FOR BIOINFORMATICS

Descriptive statistics, hypothesis testing, correlation, linear and multiple regression, method of least squares, variable selection, residual analysis, polynomial regression, nonlinear models and transformations, stepwise regression, time series analysis, trend analysis, the ratio-to-moving-average method, exponential smoothing methods and k-nearest neighbor for regression, autocorrelation.

BIO-STATISTIC TOOLS AND LIBRARIES

Introduction to some of the main software for statistical analysis: SPSS, SAS JMP, STATA, R, Weka, Excel/Calc, Matlab. Main functionalities, fields of applications and comparison of their features. Exercises on some of the statistical operators introduced in the previous lessons (OpenOffice Calc, Matlab).

ALGORITHMS AND SERVICES FOR GENOMIC ANALYSIS

Dynamic programming and problem of longest path in DAG, DNA and protein alignment, Hamming distance, edit distance, edit graphs, longest common sequence algorithm, global alignment and Needleman-Wunsch algorithm, scoring matrices (PAM, BLOSUM), local alignment and Smith-Waterman algorithm, affine gap penalties, alignment scores and statistical significance (E-score and P-score, hints), filtration, FASTA (hints) and Dot Matrices, BLAST (hints), multiple alignment, progressive alignment, Clustal algorithm (hints).

TOOLS FOR REPRESENTATION, SIMULATION AND ANALYSIS OF BIOLOGICAL SYSTEMS AND RELATED DYNAMICS

Membrane computing and P systems, MP systems and MP graphs, synthesis of flux regulation functions from observations and data analysis pipeline, MetaPlab virtual laboratory (plugin architecture, MPStore data structure, plugin implementation in Java)

WEB SERVICES FOR BIOINFORMATICS AND BIOMEDICINE

Introduction to web services: principles of functioning. Web services in development of bioinformatics software, Java EE platform and web applications, development of simple web services for bioinformatics. Case studies about web service specification and engineering. The InfoGenomics project.

Assessment methods and criteria

The exam consists of a project and an oral test. The project concerns the implementation or the application of some of the techniques explained during the course or some of their extensions. The oral test concerns the topics presented during the course.

Share