Hi all,

I'm working with a data set in which I have a limited number of "x" variables but a continuous set of "y" variables corresponding to each "x" value. i.e. I am working with a set of seven different field sites, each which have a different width. I sampled multiple plots at each site, giving me a list of 12 sample diversities taken from each plot. I'd like to see if there's a correlation between the diversity at each site and the width of the site.

I ran a Pearson's correlation test: cor.test(buf, rnat, method = "pearson", use="complete.obs") and got a significant result back, but then realized that the data describing my site widths (my "x" values) were not normally distributed, which is an assumption of the Pearson's test.

My question is: is there a way to figure out the correlation between my limited set of x data and my larger set of of y data while forgoing normal distribution? I know Spearman's rank correlation can be used with non-normal data, but it requires me to rank the data, which doesn't seem to be a viable method given there are so many more "y" values than "x".

Thank you