User intention prediction for trigger-action programming rule

This repository takes inspiration from the paper "User intention prediction for trigger-action programming rule using multi-view representation learning" https://www.sciencedirect.com/science/article/abs/pii/S0957417424030653?via%3Dihub and want to build a representation model for user intention prediction in a TAP (Trigger-action programming) framework settings.

The task is a typical multilabel classification problem.

Environment Setup

pip install -r requirements.txt

Dataset

The reference dataset is the IFTTT If-This-Then-That dataset (https://zenodo.org/records/5572861 also available on kaggle https://www.kaggle.com/code/hrs2kr/analysis-on-ifttt-dataset). The main refernce file is the Step4_Single_Trigger_IoT_Rules.csv that contains the target column "goal".

Approach

A step by step methodology oriented to clarity and pragmatism

-01_dataset_analysis_and_preparation EDA analysis and some preprocessing steps

-02_text_embedding_creation_and_representation text embedding creation (sentence-transformers library) and UMAP representation

-02_text_embedding_creation_and_representation_trasformers text embedding creation (transformers library) and UMAP representation

-02_text_embedding_creation_and_representation_finetuned text embedding creation (transformers library) and UMAP representation using a finetuned model

-02_text_embedding_creation_and_representation_LLM text embedding creation (via ollama) and UMAP representation using a decoder style model

-03_dataset_manipulation_for_multilabel_classification produce a dataset suitable for a multilabel classification task

Modeling

Starting from a baseline model the aim of the project is to build AI models of increasing complexity to achieve the best performance scores.

Using only the textual representation of the rule (text embeddings as features) and add a custom classifier on top
- 04_train_features_extractor train a classifier on top the embedding representation
- [enhanced text] add other dataset columns to the "name" column
- 05_embedding_model_finetuning finetune the base embedding model with a Contrastive Loss (using the "goal" attribute for example). Only the simplest approach (anchor, positive) + MultipleNegativesRankingLoss implemented and evaluated
Model Finetuning
- [Only last layer]
- [Some layers]
- [PEF method eg LORA]
Decoder style LLM classificator
- 06_LLM_classificator use prompting techniques to test an LLM as a classificator - Llamav3.2 1 billion parameter model
- [Work with encoder style representation and decoder style] try to merge the two approaches
Graph Neural Network modeling
- 07_kg_creation create the kg representation for the dataset the generated graph file
- [Create the graph representation and apply a GNN approach on this] extract the graph representation embeddings of the set of rules
Multi-view representation learning
- [Representation Fusion] Merge different representation

Results

	Approach	Model	Train Accuracy	Train F1-micro	Train F1-macro	Test Accuracy	Test F1-micro	Test F1-macro
1	Features extractor	bert-base-uncased + Logistic Regression	85.38%	0.92	0.94	52.60%	0.66	0.65
2	Features extractor	all-mpnet-base-v2 + Logistic Regression	73.59%	0.83	0.81	57.81%	0.71	0.69
3	Features extractor	ModernBERT-base + Logistic Regression	79.02%	0.88	0.88	47.14%	0.62	0.56
4	Features extractor	all-mpnet-base-v2 finetuned + Logistic Regression	71.76%	0.82	0.67	65.89%	0.77	0.64
5	Features extractor	llama3.2 1 billion + Logistic Regression	18.53%	0.31	0.18	14.71%	0.25	0.13

UMAP 2d representations

bert-base-uncased

all-mpnet-base-v2

ModernBERT-base

all-mpnet-base-v2 finetuned (the best clustered ones so far)

llama3.2 1 billion

Comments

Features extractor Approach

From the representation of UMAP embeddings it is evident how the all-mpnet-base-v2 model is able to group in a stronger way textual representations of rules with the same goal. This better "clustered representation" helps the cascaded classifier which therefore obtains better results (test-set).

The same logic is followed by the all-mpnet-base-v2-finetuned model which has better performance. It is the best in its category so far. It is clear from the UMAP representation and metrics results that the embedding finetuning step with the simple MultipleNegativesRankingLoss is able to improve the model overall performance.

The results of using this decoder style embedding model lllama3.2 1 billion are quite bad in comparison to the other encoder style models. This is evident from the 2d umap representation of the embeddings which clearly appear less "clustered" than the other models in relation to the classes to be discriminated.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
dataset		dataset
figures		figures
out		out
sentence_embedding_finetuning		sentence_embedding_finetuning
01_dataset_analysis_and_preparation.ipynb		01_dataset_analysis_and_preparation.ipynb
02_text_embedding_creation_and_representation.ipynb		02_text_embedding_creation_and_representation.ipynb
02_text_embedding_creation_and_representation_LLM.ipynb		02_text_embedding_creation_and_representation_LLM.ipynb
02_text_embedding_creation_and_representation_finetuned.ipynb		02_text_embedding_creation_and_representation_finetuned.ipynb
02_text_embedding_creation_and_representation_transformers.ipynb		02_text_embedding_creation_and_representation_transformers.ipynb
03_dataset_manipulation_for_multilabel_classification.ipynb		03_dataset_manipulation_for_multilabel_classification.ipynb
04_train_features_extractor.ipynb		04_train_features_extractor.ipynb
05_embedding_model_finetuning.ipynb		05_embedding_model_finetuning.ipynb
06_LLM_classificator.ipynb		06_LLM_classificator.ipynb
07_kg_creation.ipynb		07_kg_creation.ipynb
README.md		README.md
requirements.txt		requirements.txt
utils.py		utils.py
utils_ollama.py		utils_ollama.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

User intention prediction for trigger-action programming rule

Environment Setup

Dataset

Approach

Modeling

Results

UMAP 2d representations

bert-base-uncased

all-mpnet-base-v2

ModernBERT-base

all-mpnet-base-v2 finetuned (the best clustered ones so far)

llama3.2 1 billion

Comments

Features extractor Approach

About

Releases

Packages

Languages

nicolaleo/TAP_user_intention_prediction

Folders and files

Latest commit

History

Repository files navigation

User intention prediction for trigger-action programming rule

Environment Setup

Dataset

Approach

Modeling

Results

UMAP 2d representations

bert-base-uncased

all-mpnet-base-v2

ModernBERT-base

all-mpnet-base-v2 finetuned (the best clustered ones so far)

llama3.2 1 billion

Comments

Features extractor Approach

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages