1
I'm an IB diploma candidate (in HS), and writing a research paper is an important part of that curriculum. I am hoping to write my paper on how an LLM's training data impacts its output, comparing one trained on, say, Wikipedia as opposed to Reddit.
I have access to some reasonably powerful Nvidia GPUs and plenty of time to train.
I'm fairly decent at "technology," as wide of an umbrella as that is -- I use Linux, have messed with things like koboldcpp, etc. -- but my programming abilities are weak; all I've done is 6.00.1x (intro to python) through edX.
Does this seem like a reasonable project? I know the results will be bad, but will they be enough to measure differences in some way?