Bioinformatics of carbonic anhydrases
The focus of this research consists of bioinformatic studies of carbonic anhydrases (CA), ubiquitous enzymes which are found in all kingdoms of life, focusing on CAs in animals. The ultimate goal is to increase our understanding of how and why various CA isoforms and CAs in different species are different from each other. This would be useful in understanding drug specificity, and in the design of novel, improved forms of CAs. An intermediate goal in this process is the discovery of most likely evolutionary histories of CAs.
The project is run by Martti Tolvanen in collaboration with the group of Seppo Parkkila (University of Tampere).
Conserved domain and evolution of secreted phospholipases A(2)
Secreted phospholipases A(2) (sPLA(2) s) are lipolytic enzymes present in organisms ranging from prokaryotes to eukaryotes but their origin and emergence are poorly understood. We study conserved domains of sPLA(2)
and proposed a model for their evolution.
Protein hydropathy predictions
We investigated the prediction accuracy of 56 hydropathy scales by correlating predicted values with the accessible surface area in known protein structures. Results for different amino acids vary greatly within each scale. We also investigated prediction accuracies of amino acids separately in secondary structural elements and in protein fold families.
Gene expression on the theory of phase synchronization
Using phase synchronization it is possible to detect biological
associations for gene pairs with cell cycle-specific expression profiles. Phase-synchronization clustering is able to detect biologically associated gene pairs that have linearly correlated (simultaneous and inverted) as well as time-delayed expression profiles.
Tools for the submission of mutations to databases and maintenance of locus-specific mutation databases. Advanced, integrated computer systems are needed to store and organize the increasing mutation information.
Turku BioNLP Group
The main focus of the BioNLP group is the development of text mining algorithms and their utilization in knowledge discovery in biology and medicine. The flagship resources developed by the group are the TEES text mining system which won several text mining competitions, and the EVEX resource comprising detailed information about genes, proteins, and their mutual relationships mined from the entire publicly available biomedical literature. The BioNLP group has a number of collaborative projects with universities worldwide.
More information about the group’s projects and publications can be found on its homepage.
Research utilizing the Auria biobank data
The Auria biobank is a project to centralize the storage of patient samples and related information from Turku area hospitals and to provide the means for scientists to utilize this large dataset for research purposes. The IT-department is involved in the Auria-project and we will utilize our expertise in machine learning and large scale data mining to apply the biobank data to different research questions.
- A crowdsourced analysis to identify ab initio molecular signatures predictive of susceptibility to viral infection
Slim Fourati, Aarthi Talla, Mehrad Mahmoudian, Joshua G. Burkhart, Riku Klén, Ricardo Henao, Thomas Yu, Zafer Aydın, Ka Yee Yeung, Mehmet Eren Ahsen, Reem Almugbel, Samad Jahandideh, Xiao Liang, Torbjörn E.M. Nordling, Motoki Shiga, Ana Stanescu, Robert Vogel, The Respiratory Viral DREAM Challenge Consortium, Gaurav Pandey, Christopher Chiu, Micah T. McClain, Christopher W. Woods, Geoffrey S. Ginsburg, Laura L. Elo, Ephraim L. Tsalik, Lara M. Mangravite, Solveig K. Sieberts
- Ontology Development for Patient Education Documents Using a Professional- and Patient-Oriented Delphi Method
Juho Heimonen, Riitta Danielsson-Ojala, Tapio Salakoski, Heljä Lundgren-Laine, Sanna Salanterä
- Interpreting Behaviour and Emotions for People with Deafblindness
Riitta Lahtinen, Stina Ojala
- Opening up #jesuisCharlie anatomy of a Twitter discussion with mixed methods
Marjut Johansson, Aki-Juhani Kyröläinen, Filip Ginter, Lotta Lehti, Attila Krizsán, Veronika Laippala
- Wide-scope biomedical named entity recognition and normalization with CRFs, fuzzy matching and character level modeling
Suwisa Kaewphan, Kai Hakala, Niko Miekka, Tapio Salakoski, Filip Ginter
- Dependency profiles in the large-scale analysis of discourse connectives
Veronika Laippala, Aki-Juhani Kyröläinen, Jenna Kanerva, Filip Ginter
- Finding novel relationships with integrated gene-gene association network analysis of Synechocystis sp. PCC 6803 using species-independent text-mining
Sanna M. Kreula, Suwisa Kaewphan, Filip Ginter, Patrik R. Jones
- Improving layman readability of clinical narratives with unsupervised synonym replacement
Moen H., Peltonen L., Koivumäki M., Suhonen H., Salakoski T., Ginter F., Salanterä S.
- Återanvändningen av text i den finska tidningspressen 1771–1853
Salmi Hannu, Nivala Asko, Rantala Heli, Sippola Reetta, Vesanto Aleksi, Ginter Filip
- Opening up #jesuisCharlie. Anatomy of a Twitter discussion with mixed methods
Johansson Marjut, Kyröläinen Aki-Juhani, Ginter Filip, Lehti Lotta, Krizsán Attila, Laippala Veronika