Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trouble Replicating Results from Original Paper #1

Open
niuniueiko opened this issue Jul 24, 2024 · 1 comment
Open

Trouble Replicating Results from Original Paper #1

niuniueiko opened this issue Jul 24, 2024 · 1 comment
Labels
question Further information is requested

Comments

@niuniueiko
Copy link

niuniueiko commented Jul 24, 2024

Hi Andrea,

First of all, thank you for developing this amazing tool for CINSARC analysis! I am using this package to test CINSARC as a tool for sarcoma prognosis, and therefore I want to validate its efficacy by replicating the results from the original paper (Chibon et al., 2010). However, there is no algorithm or code provided that details how the nearest centroid method was applied. Thus, your package has become a lifesaver, especially for someone with a background in biology.

I preprocessed the data by normalizing it using the GCRMA method, as stated in the original paper. I then annotate Affymetrix probe sets and summarized their expression into gene-levels (using limma::avereps()). I have provided the necessary arguments (a normalized expression matrix and a vector containing metastasis details for each sample) as required by your package and successfully run the hack_cinsarc() function. However, the cinsarc prognosis result does not agree with the published result (with only a 58% similarity), which was obtained from the phenodata in the GEO expression matrix. I am wondering which step I might have gone wrong. Is there a separate training and validation step required?

Thank you in advance!

Best regards,
EZ

@Acare
Copy link
Owner

Acare commented Aug 27, 2024

Hi, thanks for using the package.

Yes, I think that with the hack_cinsarc() function you can't replicate the results of the original paper because it's based on a training-validation procedure.

The hack_cinsarc() function implements the LOOCV method from this more recent paper.

You can run ?hack_cinsarc (or go here) for further info.

Thanks,

Andrea

@Acare Acare added the question Further information is requested label Aug 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants