Md Mofijul Islam

Table of Contents

Multi-modal and Multi-task Learning [Grant]
- Transfer Learning in Visual Question Answering
Design Optimization and Evolutionary Approches
Transfer Learning Approach to Extract Fact and Validate Statement
iProtGly-SS: Identifying Protein Lysine Glycation Sites Using Sequence Features
Interpretable Machine Learning
Developer Question Answering and Repository Mining
Bangla Speech and Speakers Identification
Identification of Recombination Hotspot in Genome
Bangla Article Classfication
Big Graph Query and Mining

Representation learning is applied in different areas, such as Computer Vision, Natural Language Processing(NLP), Social Network analysis, etc. Most of the representational learning are applied in solving inference problems by leveraging the unimodal data. However, in recent years, with the increasing computation power, multi-modal learning is applied in developing different inference system. As multi-modal learning brings new information from different modalities, which in turn enable an inference system to embed this new information to learn a particular modality data representation with better confidence. For example, we can jointly learn the visual and textual data representation for Visual Question Answering System.

Outcome: I received Nvidia Academic GPU Grant to conduct this work

Transfer Learning in Visual Question Answering
Primarily, in this project, we want to apply the multi-modal and transfer learning approaches to improve the Visual Question Answering(VQA) system. We developed a multi-modal learning model by utilizing VizWiz Dataset and achieved considerable performance. We have a plan to extend this work to test our approach in other datasets by incorporating transfer learning approach.
Outcome: [Manuscript in Preparation]

Design Optimization and Evolutionary Approaches

The aim of this project is to design optimization and evolutionary approaches to solve NP-Complete problems in various domains, such as resource allocation in cloud computing environment and reduce overfitting issue as well as improve the reasoning process in learning models.

Optimized Resource Allocation in Mobile Cloud Computing
In this project, we utilized two evolutionary approaches, Genetic Algorithm and Particle Swarm Optimization(specifically Ant Colony Optimization) to design two resource allocation schemes in heterogeneous Mobile Cloud Computing(MCC) environment. This meta-heuristics resource allocation approaches helps to minimize the task execution and subsequently increase resource utilization in MCC, which is very crucial for Big data-based resource constrained cloud application, such as mobile and e-health application.
Outcome: IEEE Access 2017,NSysS 2016
Reduce Overfitting Problem in Deep Learning Models
We are currently working with some evolutionary and meta-heuristic local search approaches to reduce the overfitting problem in deep learning problem, as till now most of the effective solution of overfitting problem is not guided approach. Primarily, we are working in designing meta-heuristic dropout approach to reducing the overfitting problem.
Outcome: [Work in Progress]
Optimized Distributed Clustering Model in DSN
We developed a dynamic distributed clustering model to minimizes energy consumption and data collection time in the directional sensor network(DSN) by minimizing the active directional sensor nodes. Our clustering approach is showed effective performance to increase the network lifetime in DSN.
Outcome: EURASIP JWCN 2015,IEEE APWiMob 2014

Transfer Learning Approach to Extract Fact and Validate Statement

In recent years, with the proliferation of social media, the validating statement appears as a crucial problem and attract the attention of NLP research communities. However, due to the lack of any comprehensive dataset, NLP communities could not address this problem effectively. That’s why in this work, we are utilizing the transfer learning approach to develop a fact extraction and checking models with the few available datasets.

Outcome:[Manuscript in Preparation]

iProtGly-SS: Identifying Protein Lysine Glycation Sites Using Sequence Features

Glycation is a chemical reaction by which sugar molecule bonds with a protein without the help of enzymes. This is often caused to many diseases and therefore the knowledge about glycation is very important. In this work, we designed a supervised learning model, iProtGly-SS, to identify protein lysine glycation site based on features extracted from sequence and secondary structural information.

Outcome: Proteins Journal 2018 [Source Code, Data, Web Application]

Interpretable Machine Learning

The goal of this project is to develop applications, which enable people to understand the learning process of learning models. In recent years several complex learning models have been proposed and these models outperform in solving several complex tasks. However, these mostly fails to explain the internal reasoning process, i.e. how their internal structure learns and most importantly which features learns in a particular layer. In this project, we address this problem and develop applications which help people to understand the black-box learning process. Moreover, we are also developing application which may help to debug learning models.

Outcome: [Under Review],[Manuscript in Preparation]

Developer Question Answering and Repository Mining

The aim of this work is to mine the question answering(QA) system data, especially StackOverflow QA data, and other software project repository data in order to design some learning and thus ease the software development process. As a part of this project, we developed an accepted answer recommendation model, which helps to rank the answer in a StackOverflow question by utilizing various textual and meta-features of the question, answer and the comments RAiTA.

Outcome: Springer IEMIS 2018

Bangla Speech and Speakers Identification

Presently, in this work, we are developing a Bangla speech dataset so that researchers can develop various computational learning models, such as Bangla voice recognition, speakers identification etc. Till now, there is no public Bangla speech dataset available for research purpose.

Outcome: Data Collection Application

Identification of Recombination Hotspot in Genome

In this work, we are trying to develop a deep learning in order to extract the sequence-based features and thus identify the recombination hotspot in a genome.

Outcome: [Work in Progress]

Bangla Article Classification

We curated a comprehensive dataset of Bangla news articles, which contains around 400,000 news articles collected from different Bangla News portals. We also develop Bangla articles classification model with semantic textual features, which outperforms state-of-art-works. Moreover, we have also plan to extend this dataset, so that we can address other Bangla NLP research problems.

Outcome: ICBSLP 2018[Dataset and Source Code, Web Application]

Big Graph Query and Mining

We are working to develop algorithms which help to query and mining subgraph in a big graph dataset. Currently, we developed heuristic pruning based subgraph graph query approach to speed up the search and mining approach in the big scholarly dataset. As we are writing the paper, that’s why we can not able to disclose our complete problem description and solution. However, if you need any more information, you can contact me.

Outcome: [Manuscript in Preparation]

Multi-modal and Multi-task Learning

Transfer Learning in Visual Question Answering

Design Optimization and Evolutionary Approaches

Optimized Resource Allocation in Mobile Cloud Computing

Reduce Overfitting Problem in Deep Learning Models

Optimized Distributed Clustering Model in DSN