domain-adaptation-nlp

Dataset

Our amazon dataset (Blitzer et al., 2007) can be downloaded here. Put this file in a folder called "data/amazon_reviews".

This data contains 2000 samples of the four categories in the amazon reviews data:

Books
Electronics
Home and Kitchen (Kitchen)
Movies and TV (DVDs)

We choose these categories because they are frequently used in nlp sentiment analysis domain adaptation papers.

You can open the data (for example the amazon data) using the following code, although this step should be already included in any function you need to run.

with open("../data/amazon_reviews/amazon_4.pickle", "rb") as fr:
        all_data = pickle.load(fr)

For each element in the amazon data, and for the movie data, the structure is as follows:

[0] bert embeddings ([CLS] layer)
[1] y labels (0 means negative and 1 means positive)
[2] domain name

Instructions to run

Balanced Conf Model and Few Labels Models

Create an output folder under this root directory if it does not exist.
Run src/sentiment_classification_amazon.py from the root directory.

Householder Transformation

Adjust the n of n_fold want to use(default: 1000).
Run src/domain_space_alignment.py from the root directory.

Name		Name	Last commit message	Last commit date
Latest commit History 206 Commits
notebooks		notebooks
outputs		outputs
resources		resources
src		src
.gitignore		.gitignore
README.md		README.md
literatures.md		literatures.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

domain-adaptation-nlp

Dataset

Instructions to run

Balanced Conf Model and Few Labels Models

Householder Transformation

About

Releases

Packages

Contributors 3

Languages

zycalice/domain-adaptation-nlp

Folders and files

Latest commit

History

Repository files navigation

domain-adaptation-nlp

Dataset

Instructions to run

Balanced Conf Model and Few Labels Models

Householder Transformation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages