A researcher conducts a study on memory in a highly artificial laboratory setting using a very unusual task (e.g., memorising lists of non-existent words). While the study has high internal validity due to strict controls, what aspect of its generalisability is most likely to be low, and why?