Citation recommendation is the task of determining the most relevant citations for a given piece of text in a specific domain. Due to the large number of documents published continuously, it is not easy to find the relevant documents for a topic. In this project, you will design and implement a deep learning-based model for citation recommendation in the domain of scientific publications.
Some of the related papers in this task are listed below:
This project aims at carrying out performance evaluation of distributed streaming processing of applicaitons running on edge and cloud simultaneously. Methodology-wise students will first implement various use-cases on DSP simulators such as ECSNet++, at the second phase real world implementations using tools like Apache Storm will be required.
Automated Machine Learning (AutoML) consists of stages to automate the entire pipeline of machine learning. AutoML is especially useful for teams/organizations that does not necessarily have sufficient expertise on the AI/ML domain but expected to implement practical ML applications based on given data sources. There are already a variety frameworks for AutoML such as H2O, TPOT, Auto-SKLearn and AutoGluon.
In this project, you will work on cloud-native development and deployment of robotic applications. As the case study, we will use the Autoware repository (https://github.com/autowarefoundation/autoware) and apply cloud-native concepts and tools to build, test, and deploy Autoware in various containerization settings and granularity. You will get practical experience in using Docker, Ansible, and ROS2 tooling.
The first step in nearly all natural language processing (NLP) applications is applying preprocessing operations [1] to the text. Preprocessing operations include tokenization (segmenting the text into tokens), sentence splitting (dividing the text into sentences), normalization (converting the text into a canonical form), and the like. In this project, you will develop and implement algorithms for preprocessing of Turkish text using deep learning approaches. First, a literature review will be conducted and similar systems for English will be analyzed (e.g. UDPipe [2], Stanza [3]).
In this project, we aim at building a comprehensive word embedding [1] repository for the Turkish language. Using each of the state-of-the-art word embedding methods, embeddings of all the words in the language will be formed using a corpus. First, the three commonly-used embedding methods (Word2Vec [2,3], Glove [4], Fasttext [5]) will be used and an embedding dictionary for each one will be formed. Then we will continue with context-dependent embedding methods such as BERT [6] and Elmo [7].
In this project, you will design and implement a web service to publish senior project offers from faculty to students and disseminate the project results to a wider audience.
Roadmap:
- Requirements collection (talk with your fellow students and faculty)
- Survey of available technologies
- Designing the service according to requirements and available technologies
- Implementation and testing
Wearable devices can capture multimodal data corresponding to a person’s activity, stress, and sleep information to measure and improve health and well-being. Besides device measurement, there are also survey data as another information-gathering methodology. These can be related to gold standard questionnaires for sleep and st. Also, it may include some subjective assessments related to health satisfaction, overall health, happiness, diet, etc. In this project, the aim is to apply state-of-the-art deep learning techniques such as CNN, GNN and LSTM.