INFO: CPU: calc-server INFO: Crux version: 3.1-f2e7488 INFO: Tue Jun 19 18:20:15 MSK 2018 COMMAND LINE: /home/mark/crux/bin/crux percolator /home/mark/overfit_test/comet/confetti_trypsin_01_8.comet.target.pep.xml --decoy-prefix DECOY_ --output-dir /home/mark/overfit_test/percolator_comet_proteins --fileroot confetti_trypsin_01_8 --protein T --overwrite T --fido-empirical-protein-q T --protein-report-duplicates T INFO: Beginning percolator. INFO: Reading file /home/mark/overfit_test/comet/confetti_trypsin_01_8.comet.target.pep.xml ERROR: Decoy file '/home/mark/overfit_test/comet/confetti_trypsin_01_8.comet.decoy.pep.xml' doesn't exist INFO: Converting input to pin format. INFO: Parsing /home/mark/overfit_test/comet/confetti_trypsin_01_8.comet.target.pep.xml INFO: Assigning index 0 to /home/mark/overfit_test/comet/confetti_trypsin_01_8.comet.target.pep.xml. INFO: There are 40952 target matches and 9738 decoys INFO: Maximum observed charge is 6. INFO: File conversion complete. INFO: Percolator version 3.02.0, Build Date Mar 21 2018 17:24:57 INFO: Copyright (c) 2006-9 University of Washington. All rights reserved. INFO: Written by Lukas Käll (lukall@u.washington.edu) in the INFO: Department of Genome Sciences at the University of Washington. INFO: Issued command: INFO: percolator --results-peptides /home/mark/overfit_test/percolator_comet_proteins/confetti_trypsin_01_8.percolator.target.peptides.txt --decoy-results-peptides /home/mark/overfit_test/percolator_comet_proteins/confetti_trypsin_01_8.percolator.decoy.peptides.txt --results-psms /home/mark/overfit_test/percolator_comet_proteins/confetti_trypsin_01_8.percolator.target.psms.txt --decoy-results-psms /home/mark/overfit_test/percolator_comet_proteins/confetti_trypsin_01_8.percolator.decoy.psms.txt --verbose 2 --protein-decoy-pattern DECOY_ --seed 1 --subset-max-train 0 --trainFDR 0.01 --testFDR 0.01 --maxiter 10 --search-input auto --no-schema-validation --protein-enzyme trypsin --protein-report-duplicates --fido-protein --fido-empirical-protein-q --fido-gridsearch-depth 0 --fido-fast-gridsearch 0 --fido-protein-truncation-threshold 0.01 --fido-gridsearch-mse-threshold 0.05 --results-proteins /home/mark/overfit_test/percolator_comet_proteins/confetti_trypsin_01_8.percolator.target.proteins.txt --decoy-results-proteins /home/mark/overfit_test/percolator_comet_proteins/confetti_trypsin_01_8.percolator.decoy.proteins.txt --post-processing-tdc /home/mark/overfit_test/percolator_comet_proteins/confetti_trypsin_01_8.make-pin.pin INFO: Started Tue Jun 19 18:20:18 2018 INFO: Hyperparameters: selectionFdr=0.01, Cpos=0, Cneg=0, maxNiter=10 INFO: Reading tab-delimited input from datafile /home/mark/overfit_test/percolator_comet_proteins/confetti_trypsin_01_8.make-pin.pin INFO: Features: INFO: lnrSp XCorr Sp IonFrac PepLen Charge1 Charge2 Charge3 Charge4 Charge5 Charge6 enzN enzC enzInt lnNumDSP dM absdM INFO: Found 50690 PSMs INFO: Concatenated search input detected and --post-processing-tdc flag set. Applying target-decoy competition on Percolator scores. INFO: Train/test set contains 40952 positives and 9738 negatives, size ratio=4.20538 and pi0=1 INFO: Selecting Cpos by cross-validation. INFO: Selecting Cneg by cross-validation. INFO: Split 1: Selected feature 2 as initial direction. Could separate 17676 training set positives with q<0.01 in that direction. INFO: Split 2: Selected feature 2 as initial direction. Could separate 17532 training set positives with q<0.01 in that direction. INFO: Split 3: Selected feature 2 as initial direction. Could separate 17429 training set positives with q<0.01 in that direction. INFO: Found 26261 test set positives with q<0.01 in initial direction INFO: Reading in data and feature calculation took 2.44392 cpu seconds or 2 seconds wall clock time. INFO: ---Training with Cpos selected by cross validation, Cneg selected by cross validation, initial_fdr=0.01, fdr=0.01 INFO: Iteration 1: Estimated 28988 PSMs with q<0.01 INFO: Iteration 2: Estimated 29720 PSMs with q<0.01 INFO: Iteration 3: Estimated 29911 PSMs with q<0.01 INFO: Iteration 4: Estimated 29945 PSMs with q<0.01 INFO: Iteration 5: Estimated 29971 PSMs with q<0.01 INFO: Iteration 6: Estimated 29970 PSMs with q<0.01 INFO: Iteration 7: Estimated 29955 PSMs with q<0.01 INFO: Iteration 8: Estimated 29952 PSMs with q<0.01 INFO: Iteration 9: Estimated 29959 PSMs with q<0.01 INFO: Iteration 10: Estimated 29958 PSMs with q<0.01 INFO: Learned normalized SVM weights for the 3 cross-validation splits: INFO: Split1 Split2 Split3 FeatureName INFO: -1.3543 -2.2051 -1.4332 lnrSp INFO: 2.9463 2.8785 2.7031 XCorr INFO: -0.1813 -0.1968 -0.0685 Sp INFO: 0.1730 0.1941 0.2415 IonFrac INFO: -0.4275 -0.4056 -0.2960 PepLen INFO: 0.0000 0.0000 0.0000 Charge1 INFO: -0.0341 -0.0412 -0.0574 Charge2 INFO: 0.0120 0.0171 0.0441 Charge3 INFO: 0.0241 0.0331 0.0184 Charge4 INFO: 0.0759 0.0577 0.0380 Charge5 INFO: -0.0027 0.0065 0.0019 Charge6 INFO: 0.0000 0.0000 0.0000 enzN INFO: 0.0000 0.0000 0.0000 enzC INFO: -0.4649 -0.4999 -0.4553 enzInt INFO: -0.1259 -0.1027 -0.1217 lnNumDSP INFO: 0.3813 0.3763 0.3783 dM INFO: -0.2039 -0.2494 -0.2101 absdM INFO: 0.1653 -0.3885 0.0750 m0 INFO: Found 29888 test set PSMs with q<0.01. INFO: Selected best-scoring PSM per scan+expMass (target-decoy competition): 40952 target PSMs and 9738 decoy PSMs. INFO: Tossing out "redundant" PSMs keeping only the best scoring PSM for each unique peptide. INFO: Calculating q values. INFO: Final list yields 21862 target peptides with q<0.01. INFO: Calculating posterior error probabilities (PEPs). INFO: Processing took 71.36 cpu seconds or 40 seconds wall clock time. INFO: INFO: Calculating protein level probabilities. INFO: Copyright (c) 2008-9 University of Washington. All rights reserved. INFO: Written by Oliver R. Serang (orserang@u.washington.edu) in the INFO: Department of Genome Sciences at the University of Washington. INFO: INFO: Initialized protein inference engine. INFO: Computing protein probabilities. INFO: The parameters for the model will be estimated by grid search. INFO: INFO: Estimating the parameters took : 10.1388 cpu seconds or 10 seconds wall time INFO: The following parameters have been chosen: INFO: alpha = 0.008 INFO: beta = 0.001 INFO: gamma = 0.5 INFO: INFO: Protein level probabilities will now be estimated INFO: Computing protein statistics. INFO: protein pi0 estimate = 0.7497801716 INFO: Number of protein groups identified at q-value = 0.01: 3797 INFO: Estimating protein probabilities took : 26.85 cpu seconds or 26 seconds wall clock time. INFO: Elapsed time: 73.3 s INFO: Finished crux percolator. INFO: Return Code:0