Eric15342335 \| HKU AppliedAI Year 3

How to make the most of your ChatGPT $20 subscription on the Web UI

2026-07-14T20:29:36+08:00

We often imagine an agent that can work for us continuously - researching, self-criticizing, understanding the big picture, and avoiding premature convergence.

Here is the simplest prompt I found to keep ChatGPT 5.5 or 5.6 working on a task for longer.

Copy and paste the following verbatim into the chat:

Spend at least [N minutes] actively working on this task before giving the final answer. Count only productive task work, not idle time, waiting, sleep commands, artificial delays, or time spent merely keeping a timer running. Use timing utilities only to measure elapsed time, and briefly report how you measured it.

Practically, the value of N can be set to N <= 30. Otherwise, ChatGPT’s internal safety/alignment guardrails might trigger (it may detect time‑wasting behavior) and refuse to operate for a long time.

I store my reusable prompts in my GitHub repository: eric15342335/misc - prompts.

I am expected to graduate in July 2027 and am seeking entry-level roles in cloud computing and AI.

some funny unrelated stories from my algorithm class journey

2026-07-06T01:53:00+08:00

https://youtu.be/EqrszD4XYSM

For context, this type of video is basically a 爽文-style animated novel recap. The male protagonist is treated as a villain by several female leads, but after his memories are exposed, everyone realizes that his “evil” actions were actually sacrifices. Later in the plot, even his death turns out to be part of saving the world. The story is exaggerated and funny, but maybe it stayed in my mind because it is also about being judged before the full path is visible.

This reminds me of my COMP3252 lecture stories. I remember when the professor was teaching graph algorithms like Floyd-Warshall, I was watching some videos in this style. Obviously, this was not a learning strategy I would recommend to anyone, including my past self. I cannot believe I am still watching the same type of videos today.

Note: Later, I realized that I had placed heavy emphasis on this course, and that binge-watching was not a long-term solution. I studied hard and spent a lot of time revising it. I do not regret getting a grade A.

So I agree with one quote that says: 人生有很多条路，怎么走是自己决定的，没有说哪条路算正确的，自己不后悔就行。

I hope that when I graduate and look back on my past blogs like this, I will not regret what I have done, or the life choices I have made.

Eric

How to check AWS Lightsail 90 days trial usage

2026-07-03T19:24:02+08:00

If you want to quickly check whether your AWS Lightsail usage is still covered by the new customer 90-day free benefit, here is the simplest way I use.

Step 1

Go to Billing and Cost Management.

Step 2

Go to Bills and click Expand All.

Step 3

Go check the Lightsail resource. The $0.00 per Hrs from 0 to 750 for BundleUsage:2GB in Asia Pacific (Singapore) will appear. $0 indicates your resource usage is under the new customer 90 days free benefit.

Alternatively, you can ask Amazon Q to help you query the data, or help you find it.

One click install RustNet on Debian 13 with GeoLite2 database

2026-06-28T14:20:00+08:00

Background

As of now, RustNet is not available in the official Debian 13 repositories.

Manual installation is required, which downloads Debian packages from GitHub releases. We can call the GitHub public API to retrieve the latest release and download the Debian package automatically.

Also, rustnet supports GeoLite2 database for IP geolocation.

We can download the latest GeoLite2 database from unofficial GitHub releases and place it in the /root directory. When you log in as the root user, rustnet will automatically load the GeoLite2 database in the /root directory. You can put the database in any of the supported paths too.

(If you have a MaxMind account, you could use the tool sudo apt install geoipupdate mmdb-bin instead.)

root@localhost:~# rustnet -h
Cross-platform network monitoring tool

Usage: rustnet [OPTIONS]

Options:
// ...
      --geoip-country 
          Path to GeoLite2-Country.mmdb database. Auto-discovered from: ./resources/geoip2, $XDG_DATA_HOME/rustnet/geoip, ~/.local/share/rustnet/geoip, /usr/share/GeoIP, /usr/local/share/GeoIP, /opt/homebrew/share/GeoIP, /var/lib/GeoIP
      --geoip-asn 
          Path to GeoLite2-ASN.mmdb database (same search paths as --geoip-country)
      --geoip-city 
          Path to GeoLite2-City.mmdb database (same search paths as --geoip-country; superset of Country — provides city name and postal code in addition to country)
      --no-geoip
          Disable GeoIP lookups entirely
// ...

Script

This script assumes the root user. Run sudo su first if you are not root.

deb="/root/rustnet.deb"

arch="$(dpkg --print-architecture)"
case "$arch" in
amd64|arm64|armhf) ;;
*) echo "Unsupported architecture for RustNet: $arch" >&2; exit 1 ;;
esac

url="$(
curl -fsSL https://api.github.com/repos/domcyrus/rustnet/releases/latest \
| grep "browser_download_url" \
| grep -io "https://[^\"]*rustnet_linuxdeb_${arch}\\.deb" \
| head -n1
)"

test -n "$url"
wget -q "$url" -O "$deb"
dpkg -i "$deb" || apt-get install -f -y
rm -f "$deb"

cd /root
wget -qO- https://api.github.com/repos/P3TERX/GeoLite.mmdb/releases/latest \
| grep "browser_download_url" \
| grep -E "GeoLite2-(ASN|City|Country)\.mmdb" \
| cut -d '"' -f 4 \
| wget -qi -

rustnet

How to save your HKU transcript (unofficial copy) beautifully

2026-06-28T13:50:00+08:00

Step 1

Select “Transcript (Student Copy)”

Step 2

Right click at the empty background of the page and select “This Frame” > “Print Frame…” to open the print dialog.

Different browser UI might vary. I used Firefox in this example.

The idea is to print the frame instead of the whole page, so that the header and footer of the page are not included in the printout.

Step 3

Select “Save to PDF” as the printer and click “Print” to save the transcript as a PDF file.

Interest

2026-04-05T19:45:00+08:00

今天研究了一些computer vision的玩意（具体是什么不重要）。

然后我在想，其实我真正感兴趣的field是什么，

我又怎么知道我真正感兴趣的是什么呢？

很多时候，你感兴趣，可能是因为你没有压力，没有deadline。

当爱好遇上deadline，demand，sprint goal，说不定再喜欢的爱好也会变成一种折磨。

不知道呢，唉

OpenMP on Windows stack overflow when using large stack

2026-02-26T20:58:30+08:00

On Windows, when we compile an OpenMP program with large stack usage using mingw-w64 by default, it will throw a stack overflow error (0xc00000fd):

123er@eric310  /d/Personal Data/Repositories/personal-repo/APAI4013/Assignment-1
$ gcc -fopenmp task2.c -static -g

123er@eric310  /d/Personal Data/Repositories/personal-repo/APAI4013/Assignment-1
$ strace ./a
--- Process 6256 created
--- Process 6256 loaded C:\Windows\System32\ntdll.dll at 00007fffbfd20000
--- Process 6256 loaded C:\Windows\System32\kernel32.dll at 00007fffbe720000
--- Process 6256 loaded C:\Windows\System32\KernelBase.dll at 00007fffbc810000
--- Process 6256 loaded C:\Windows\System32\ucrtbase.dll at 00007fffbd970000
--- Process 6256 thread 5832 created
--- Process 6256, exception c00000fd at 00007ff761779226
--- Process 6256 thread 17792 exited with status 0xc00000fd
--- Process 6256 exited with status 0xc00000fd

Adding the -Wl,--stack,1000000000 flag to GCC can resolve this issue by increasing the stack size used by the program:

123er@eric310  /d/Personal Data/Repositories/personal-repo/APAI4013/Assignment-1
$ gcc -fopenmp task2.c -static -g -Wl,--stack,10000000000

123er@eric310  /d/Personal Data/Repositories/personal-repo/APAI4013/Assignment-1
$ strace ./a
--- Process 20596 created
--- Process 20596 loaded C:\Windows\System32\ntdll.dll at 00007fffbfd20000
--- Process 20596 loaded C:\Windows\System32\kernel32.dll at 00007fffbe720000
--- Process 20596 loaded C:\Windows\System32\KernelBase.dll at 00007fffbc810000
--- Process 20596 loaded C:\Windows\System32\ucrtbase.dll at 00007fffbd970000
--- Process 20596 thread 1092 created
--- Process 20596 thread 5492 created
// ... more threads created
--- Process 20596 loaded C:\Windows\System32\kernel.appcore.dll at 00007fffbb4c0000
--- Process 20596 loaded C:\Windows\System32\msvcrt.dll at 00007fffbf020000
Time taken: 0.898000 seconds
41
6334
15724
24464
9961
32391
18716
19912
17673
20037
 18467
 26500  19169
 11478  29358  26962
  5705  28145  23281  16827
   491   2995  11942   4827   5436
 14604   3902    153    292  12382  17421
 19718  19895   5447  21726  14771  11538   1869
 25667  26299  17035   9894  28703  23811  31322  30333
  4664  15141   7711  28253   6868  25547  27644  32662  32757
 12859   8723   9741  27529    778  12316   3035  22190   1842    288
757147
122502946
610374658
956230507
379002193
722484005
1299818353
-1370045995
-743873488
1826067517
--- Process 20596 thread 21820 exited with status 0x0
--- Process 20596 thread 15588 exited with status 0x0
// ... more threads exited
--- Process 20596 exited with status 0x0

The program:

#include 
#include 
#include 

#define n 16000
int main(void) {
    int L[n][n];
    int X[n];
    int Y[n];
    double start_time, end_time;

    omp_set_num_threads(16);

    start_time = omp_get_wtime();
    // init fixed seed
    srand(1);

    // init L, X
    for (int i = 0; i < n; i++) {
        X[i] = rand();
        for (int j = 0; j <= i; j++) {
            L[i][j] = rand();
        }
        Y[i] = 0;
    }

    #pragma omp parallel for schedule(static)
    for (int i = 0; i < n; i++) {
        for (int j = 0; j <= i; j++) {
            Y[i] += L[i][j] * X[j];
        }
    }

    end_time = omp_get_wtime();
    printf("Time taken: %f seconds\n", end_time - start_time);

    // Visualize the result
    for (int i = 0; i < 10; i++) {
        printf("%d\n", X[i]);
    }
    for (int i = 0; i < 10; i++) {
        for (int j = 0; j <= i; j++) {
            printf("%6d ", L[i][j]);
        }
        printf("\n");
    }
    for (int i = 0; i < 10; i++) {
        printf("%d\n", Y[i]);
    }
    return 0;
}

Appendix

Compiler version:

$ gcc -v
Using built-in specs.
COLLECT_GCC=D:\Personal Data\Repositories\mingw64\bin\gcc.exe
COLLECT_LTO_WRAPPER=D:/Personal\ Data/Repositories/mingw64/bin/../libexec/gcc/x86_64-w64-mingw32/15.2.0/lto-wrapper.exe
OFFLOAD_TARGET_NAMES=nvptx-none
Target: x86_64-w64-mingw32
Configured with: ../configure --prefix=/R/winlibs_staging_ucrt64/inst_gcc-15.2.0/share/gcc --build=x86_64-w64-mingw32 --host=x86_64-w64-mingw32 --enable-offload-targets=nvptx-none --with-pkgversion='MinGW-W64 x86_64-ucrt-posix-seh, built by Brecht Sanders, r5' --with-tune=generic --enable-checking=release --enable-threads=posix --disable-sjlj-exceptions --disable-libunwind-exceptions --disable-serial-configure --disable-bootstrap --enable-host-shared --enable-plugin --disable-default-ssp --disable-rpath --disable-libstdcxx-debug --disable-version-specific-runtime-libs --disable-symvers --enable-languages=c,c++,fortran,lto,objc,obj-c++ --disable-gold --disable-nls --disable-stage1-checking --disable-win32-registry --disable-multilib --enable-ld --enable-libquadmath --enable-libada --enable-libssp --enable-libstdcxx --enable-lto --enable-fully-dynamic-string --enable-libgomp --enable-graphite --enable-mingw-wildcard --enable-libstdcxx-time --enable-libstdcxx-pch --with-mpc=/c/Prog/winlibs_staging_ucrt/custombuilt64 --with-mpfr=/c/Prog/winlibs_staging_ucrt/custombuilt64 --with-gmp=/c/Prog/winlibs_staging_ucrt/custombuilt64 --with-isl=/c/Prog/winlibs_staging_ucrt/custombuilt64 --disable-libstdcxx-backtrace --enable-install-libiberty --enable-__cxa_atexit --without-included-gettext --with-diagnostics-color=auto --enable-clocale=generic --enable-libgdiagnostics --with-libiconv --with-system-zlib --with-build-sysroot=/R/winlibs_staging_ucrt64/gcc-15.2.0/build_mingw/mingw-w64 CFLAGS='-I/c/Prog/winlibs_staging_ucrt/custombuilt64/include/libdl-win32   -march=nocona -msahf -mtune=generic -O2 -Wno-error=format' CXXFLAGS='-Wno-int-conversion  -march=nocona -msahf -mtune=generic -O2' LDFLAGS='-pthread -Wl,--no-insert-timestamp -Wl,--dynamicbase -Wl,--high-entropy-va -Wl,--nxcompat -Wl,--tsaware' LD=/c/Prog/winlibs_staging_ucrt/custombuilt64/share/binutils/bin/ld.exe
Thread model: posix
Supported LTO compression algorithms: zlib zstd
gcc version 15.2.0 (MinGW-W64 x86_64-ucrt-posix-seh, built by Brecht Sanders, r5)

$ gcc --version
gcc.exe (MinGW-W64 x86_64-ucrt-posix-seh, built by Brecht Sanders, r5) 15.2.0
Copyright (C) 2025 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

Future Data Leakage in (subset of) MIMIC-IV Readmission EHR Dataset

2025-12-08T03:00:00+08:00

It has been a long time that I don’t write any blogs.

Note: This is a cross-post from my Kaggle competition discussion. It is private right now but the professor/TA could make it public if they want to.

Anyway, lets talk about our main topic: Future Data Leakage.

In Electronic Health Records (abbr: EHR) datasets, we (usually) have variable number of rows per patient admission, and one label (corresponding to that admission) as our base prediction unit. Unfortunately (and logically), some admissions are correlated, as they could come from the same patient (obviously), or there was an public incident (disasters), pandemic outbreak, or annual events, etc.

We are going to talk about the Readmission Prediction task. Our job is given a list (time series within 1 admission) of patient vitals (E.g. blood pressure, heart rate, laboratory test results), medicine adminstrated, patient demographics (age, ethnicity, gender), etc., to predict whether the patient will be readmitted within 30 days after his hospital discharge (aka. come back within 30 days after he leaves the hospital).

There was one particular flaw (or, a feature?) in the subset of dataset we received (the professor gave us in HKU STAT3612), in which, if the patient only appear once in the dataset, then we can successfully infer that he didn’t come back hospital after his only recorded admission in our database.

This approach has two problems:

It won’t work for patient databases that are incomplete. E.g. we don’t have all the admissions of a patient, or the patient moved to another hospital after his first admission.
The model is biased when predicting patients with only one admission in the dataset (e.g. predict False for most of them). While it might work well in this particular dataset, it won’t generalize well to real-life data, especially if the real-life patient admission is diverse enough. For example, under a pandemic situation, many patients might only have one admission (the first time they got infected), but they might come back again later (e.g. due to complications). In this case, a model focusing on patient vitals and demographics would be more robust (in terms of distribution shift) than a model relying on admission counts.

But apparently, with a simple logic like this, we were able to boost our AUROC (Area Under Receiver Operating Characteristic curve) score from 0.5 (basically means the model is guessing randomly) to 0.617:

import pandas as pd
from sklearn.metrics import roc_auc_score

label = "readmitted_within_30days"

train = pd.read_csv("train.csv").drop_duplicates(subset="id")
valid = pd.read_csv("valid.csv").drop_duplicates(subset="id")

train_visit_counts = train["subject_id"].value_counts()
valid_visit_counts = valid["subject_id"].value_counts()

def predict(row, visit_counts):
    patient_only_visit_once = visit_counts.get(row["subject_id"], 1) == 1
    if patient_only_visit_once:
        return 0
    return 1

train["prediction"] = train.apply(lambda row: predict(row, train_visit_counts), axis=1)
valid["prediction"] = valid.apply(lambda row: predict(row, valid_visit_counts), axis=1)

print(f"Train AUROC: {roc_auc_score(train[label], train["prediction"]):.4f}")
print(f"Valid AUROC: {roc_auc_score(valid[label], valid["prediction"]):.4f}")

Which is pretty amazing. And we were able to obtain AUROC of 0.95 on the public leaderboard (50%, ~1000 test data) and secure the top 1 (on Public LB). For private leaderboard, that is another story.

Anyway, the professor banned this technique which is totally reasonable given that this is an severe data leakage (this could harm your model performance when being feed with real-life data). But I think this is quite interesting, and I would like to share it here.

A sample codebase implementing a complete machine learning pipeline on this dataset can be found here: https://github.com/eric15342335/STAT3612-MIMIC-IV-Readmission-Prediction-Demo. Note that this is just a demo codebase which was created separately after the competition.

Good luck with everyone!

How to enable RTX Virtual Super Resolution (VSR) or HDR in VLC

2025-05-17T20:00:00+08:00

If you are coming here because you have a relatively new desktop/laptop, with a recent GPU like the 30 or 40 series (e.g. RTX 4050 Laptop), and you want to improve the video quality when you watch videos using VLC, then yes I did exactly the same thing. Here is how.

The first Google search probably brings you to here, the official VLC webpage with version 3.0.19. In fact, when I use the version, it sometimes crashes, and in some scenarios that I cannot reproduce, I cannot use Virtual Super Resolution (VSR) and the SDR-to-HDR mapping feature (powered by AI) at the same time. Therefore, I suggest you use the latest (which is 3.0.21) version of VLC, and manually enable the settings (very quick) so that you can enjoy a stable viewing experience (Note that this version also has some bugs, like fullscreen resolution is not really upscaled (black bars around the video), but the RTX features work).

Tools -> Preferences -> Video -> Show Settings (All) -> Search “3D11” (which is Direct3D 11) -> Output Modules - Direct3D 11 -> Change “Video Upscaling Mode” to “Super Resolution”, Change “HDR Output Mode” to “Generate HDR from SDR”. -> (You usually don’t need to) Go back to “Output Module”, change “Automatic” to “Direct3D 11”.

Don’t forget to:

Set the “vlc.exe” to “High Performance GPU (NVIDIA RTX …)” in your Windows settings.
Enable VSR and HDR features in the NVIDIA App or NVIDIA Control Panel.
Install latest drivers, Windows Update, general advice, etc.

AI is Going to Replace Most Work

2025-04-11T00:42:00+08:00

Clickbait (not really)

Background

Here are just my thoughts on AI (in particular Large Language Models, since it is currently the most impactful AI system for normal people). I do write them in point-form, so don’t expect coherence here :(

Requirements for AI Replacement

A lot of people did the same/similar work previously
Examples were readily available online

Statement: No matter how powerful LLMs are (reasoning, agents, long context retrieval, etc.), they still require previously seen knowledge and cannot solely rely on their “hallucination”.

Hypothesis: AI with significant reasoning ability cannot complete a task flawlessly if it is completely new to either humans or AI systems in general.

Result

AI is able to train on and remember these examples, letting anyone do the same thing with minimal prior knowledge.

Solution

Go to fields that have minimal examples:

Fields with few practitioners
- Relatively unused fields like certain historical niches
  - (Though I likely won’t study these fields)
- Useful fields that require significant knowledge and research (e.g., SOTA)
Fields with no patterns
- Fields that are purely random in nature
  - (I likely won’t succeed in these fields due to their indeterministic nature)
Fields that constantly generate new examples
- Faster than what AI can learn
- Question: As AI scales up every year, what fields can really keep pace?

What AI have I been using recently?

ChatGPT 4o (on chatgpt.com)

Image generation ability (as of Apr 11, 2025)
- Native generation is really impressive and is able to form correct, clean text such as banners or blackboards, etc. However, there are still some issues with the geometric understanding.

Gemini 2.5 Pro Preview (on aistudio.google.com)

Really impressive long context retrieval and reasoning ability
- It is able to perform calculation step by step (not skipping steps unlike other AI models when they receive 10 math questions and told to do all at once). It is able to make best informed decision according to the context provided. TL;DR, it is attentive to details (Yes K.P. Wat!)
Unlimited usage and free, can’t demand more, right?

Gemini DeepResearch (on gemini.google.com)

This is really impressive. I used it for several purposes:
- Finding scholarly published papers and using them as my APA citations for my essay assignments and other stuff
- Performing in-depth broad range analysis and investigation on one particular field or topic
  - It is able to consider almost all aspects of the topic (primarily due to the mass amount of information it receives on the internet, and its reasoning ability) and generate a detailed report. If my prompt is detailed, then I would expect more than 10 pages, 5000+ words report (including references).
- I almost built a workflow for this:
  1. (This should be step 0) Come up with an interesting topic that I want to read about and spend time on it
  2. Ask Gemini 2.5 Pro, or other AI models to understand what I don’t know, and to provide me with a detailed prompt which covers various aspects of the topic to be explored on
  3. Throw the prompt into Gemini DeepResearch
  4. In the meantime, generate a $\LaTeX{}$ template using either Gemini 2.5 Pro or Claude 3.7 Sonnet (you may ask why I use this? It is because I bought Claude Pro, so not using it feels like wasting money 😅)
  5. Wait until the Gemini DeepResearch finishes
  6. Throw the entire report into ChatGPT 4o and let it generate a cover page image (using its native image generation ability)
  7. Ask any of the LLM models mentioned above to amend the $\LaTeX{}$ template in order to adjust the section, subsection and other stuff that the DeepResearch-generated report gave us
  8. Copy respective sections from the report and paste them into the $\LaTeX{}$ template
  9. Fix the errors that arise, this step should take around 5-10 trials depending on luck
  10. $\cdots$
  11. Done!
- In fact, you can see one of my previously generated report samples here

Claude 3.7 Sonnet (thinking) (on claude.ai via my Claude Pro subscription)

Before the GPT-4o image generation ability was released, I thought this was the best decision (of subscribing to an AI service) because Claude was the strongest LLM model at (web development specifically) programming and I have a web development course (COMP3322)
It turns out GPT-4o native image output released few days after
Gemini 2.5 Pro followed next
- So it was kind of not a good time to buy a subscription lol
UI is generally good, with support of React artifacts (I primarily use it for data visualization since I don’t need to run it on my python lol)
In terms of the attention to details ability, it is not as good as Gemini 2.5 Pro (which is able to perform step by step calculation). Claude 3.7 Sonnet (thinking) would just summarize everything if I didn’t explicitly ask it to explore every step in-depth