Skip to main content
NC State Home

Sharma Thankachan

ST

Associate Professor

3278 Engineering Building II (EB2)

919-513-0816 Website

Bio

Sharma V. Thankachan is an associate professor in the Department of Computer Science at NC State University. His research focuses on string algorithms, compressed data structures, and their applications in areas such as computational biology.

Prior to joining NC State in 2022, Thankachan was an assistant professor at the University of Central Florida. He completed postdoctoral research at the Georgia Institute of Technology and the University of Waterloo. He earned his Ph.D. in computer science from Louisiana State University and a B.Tech. in electrical and electronics engineering from the National Institute of Technology Calicut, India.

Thankachan has advised several doctoral students, including:

  • Mano Prakash Parthasarathi

  • Paul Macnichol

  • Oliver Chubet (co-advised with Donald Sheehy)

His doctoral alumni include:

  • Dr. Paniz Abedin, Assistant Professor of Computer Science at Florida Polytechnic University

  • Dr. Daniel Gibney, Assistant Professor of Computer Science at The University of Texas at Dallas

  • Dr. Sahar Hooshmand, Assistant Professor of Computer Science at California State University, Dominguez Hills

Education

Ph.D. Computer Science Louisiana State University, Baton Rouge 2014

B.Tech Electrical and Electronics Engineering National Institute of Technology, Calicut, India 2006

Area(s) of Expertise

Algorithms and Theory of Computation

Publications

View all publications

Grants

Date: 01/01/23 - 4/30/27
Amount: $513,686.00
Funding Agencies: National Science Foundation (NSF)

This project aims to address the following question: How to model the combined information of a pan-genome collection succinctly (and in a biologically meaningful way) such that the genomic analysis on that representation is both easy-to-compute and accurate? Pan-genome collections may be represented as high-scoring Multiple Sequence Alignment (MSA) data, indexed text data, or the more popular graph-based representations (pan-genome graphs). These models need to support read mapping queries efficiently. This research will lead to a new class of string/graph algorithms for the analysis of pan-genomic data.

Date: 03/01/23 - 1/31/27
Amount: $414,034.00
Funding Agencies: National Science Foundation (NSF)

Being able to store, search, and analyze massive data sets efficiently is one of today's pressing challenges. This project will study a collection of problems under text compression and indexing with tremendous current relevance, owing to a specific characteristic prevalent in many modern text data sets, called high repetitiveness. This characteristic makes the data highly compressible using some specialized schemes. However, the theoretical understanding of those schemes is still in a nascent stage. We will address some of the fundamental open problems on the effectiveness of several schemes that are popular in practice


View all grants
  • National Science Foundation Faculty Early CAREER Development Award - 2022