Monday, September 16, 2024

Time Sequence Are Not That Completely different for LLMs | by Henry Lai | Jul, 2024

Must read


6. Bundling all these takeaways create a LTSM mannequin (LTSM-Bundle) that outperforms all present strategies that re-programming LLM for time collection and transformer based mostly time collection forecasting fashions.

Evaluating the bundle with present frameworks. Picture by creator.

Re-program a LTSM your self!

Wanna attempt to re-program your individual LTSM? Right here is the tutorial for the LTSM-bundle: https://github.com/daochenzha/ltsm/blob/principal/tutorial/README.md

Step 1: Create a digital setting. Clone and set up the necessities and the repository.

conda create -n ltsm python=3.8.0
conda activate ltsm
git clone git@github.com:daochenzha/ltsm.git
cd ltsm
pip3 set up -e .
pip3 set up -r necessities.txt

Step 2: Put together your dataset. Ensure that your native information folder like following:

- ltsm/
- datasets/
DATA_1.csv/
DATA_2.csv/
DATA_3.csv/
...

Step 3: Producing the time collection prompts from coaching, validating, and testing datasets

python3 prompt_generate_split.py

Step 4: Discover the generated time collection prompts within the ‘./prompt_data_split’ folder. Then run the next command for finalizing the prompts:

# normalizing the prompts
python3 prompt_normalization_split.py --mode match

#export the prompts to the "./prompt_data_normalize_split" folder
python3 prompt_normalization_split.py --mode rework

Ultimate Step: Practice your individual LTSM with Time Sequence Immediate and Linear Tokenization on gpt2-medium.

python3 main_ltsm.py 
--model LTSM
--model_name_or_path gpt2-medium
--train_epochs 500
--batch_size 10
--pred_len 96
--data_path "DATA_1.csv DATA_2.csv"
--test_data_path_list "DATA_3.csv"
--prompt_data_path "prompt_bank/prompt_data_normalize_split"
--freeze 0
--learning_rate 1e-3
--downsample_rate 20
--output_dir [Your_Output_Path]

Checkout extra particulars in our paper and GitHub Repo:

Paper: https://arxiv.org/pdf/2406.14045
Code: https://github.com/daochenzha/ltsm/

Reference:

[1] Liu, Pengfei, et al. “Pre-train, immediate, and predict: A scientific survey of prompting strategies in pure language processing.” ACM Computing Surveys 55.9 (2023): 1–35.

[2] Liu, Xiao, et al. “Self-supervised studying: Generative or contrastive.” IEEE transactions on data and information engineering 35.1 (2021): 857–876.

[3] Ansari, Abdul Fatir, et al. “Chronos: Studying the language of time collection.” arXiv preprint arXiv:2403.07815 (2024).



Supply hyperlink

More articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest article