<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://bioinfo-core.org/index.php?action=history&amp;feed=atom&amp;title=User_talk%3ABioinfocoreguru</id>
	<title>User talk:Bioinfocoreguru - Revision history</title>
	<link rel="self" type="application/atom+xml" href="http://bioinfo-core.org/index.php?action=history&amp;feed=atom&amp;title=User_talk%3ABioinfocoreguru"/>
	<link rel="alternate" type="text/html" href="http://bioinfo-core.org/index.php?title=User_talk:Bioinfocoreguru&amp;action=history"/>
	<updated>2026-04-05T23:20:00Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.34.0</generator>
	<entry>
		<id>http://bioinfo-core.org/index.php?title=User_talk:Bioinfocoreguru&amp;diff=6962&amp;oldid=prev</id>
		<title>Bioinfocoreguru: Introduction - Bioinfocoreguru</title>
		<link rel="alternate" type="text/html" href="http://bioinfo-core.org/index.php?title=User_talk:Bioinfocoreguru&amp;diff=6962&amp;oldid=prev"/>
		<updated>2009-09-30T21:10:30Z</updated>

		<summary type="html">&lt;p&gt;Introduction - Bioinfocoreguru&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #222; text-align: center;&quot;&gt;Revision as of 21:10, 30 September 2009&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l3&quot; &gt;Line 3:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 3:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;We handle &amp;gt;20 data analysis project/year with a staff of four at the Bioinformatics Analysis Core at the University of Pittsburgh (Genomics and Proteomics Core Laboratories).  Our model is fee-for-service.  The process involves an orientation meeting w/the investigators to discuss study aims and study design.  We provide advice on study design for free.  We derive a cost estimate for the scope of work (data analysis &amp;amp; research services).  The investigator&amp;#039;s project enters our data analysis queue when the data are produced. We maintain separate queues for microarray data analysis, top-down biomarker development (prediction modeling), proteomics (mass-spec, 2D-DIGE, multiplexed panels), and Next Generation Sequencing (GS-FLX and SoLiD).  Our product is a Preliminary Research Report communicated to the PI in person (via PPT).&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;We handle &amp;gt;20 data analysis project/year with a staff of four at the Bioinformatics Analysis Core at the University of Pittsburgh (Genomics and Proteomics Core Laboratories).  Our model is fee-for-service.  The process involves an orientation meeting w/the investigators to discuss study aims and study design.  We provide advice on study design for free.  We derive a cost estimate for the scope of work (data analysis &amp;amp; research services).  The investigator&amp;#039;s project enters our data analysis queue when the data are produced. We maintain separate queues for microarray data analysis, top-down biomarker development (prediction modeling), proteomics (mass-spec, 2D-DIGE, multiplexed panels), and Next Generation Sequencing (GS-FLX and SoLiD).  Our product is a Preliminary Research Report communicated to the PI in person (via PPT).&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;A key component of our core is our focus, for each data stream, on continuous methodological optimization.  We call it &amp;quot;Intelligent Objective Methods Optimization&amp;quot;; we use criteria such a internal consistency for instances where the true status of genes/proteins is unknown (via Efficiency Analysis, (&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;[[&lt;/del&gt;Jordan et al. 2008]&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;]) &lt;/del&gt;a resampling method developed for microarray data normalization + test comparative evaluation).  We used spiked-in sample sets (e.g., protein standards) to explore method space for situation when the true status of a gene/protein is known (e.g., &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;[[&lt;/del&gt;Sultana et al. 2009&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;]&lt;/del&gt;]).  By comparing various methods under objective criteria, we acquire increased confidence in the interpretation.  Key to this is to build the evaluative comparative capabilities into the analysis software resources we build (e.g., &lt;del class=&quot;diffchange diffchange-inline&quot;&gt;[[&lt;/del&gt;Patel and Lyons-Weiler, 2004]&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;])&lt;/del&gt;.&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;A key component of our core is our focus, for each data stream, on continuous methodological optimization.  We call it &amp;quot;Intelligent Objective Methods Optimization&amp;quot;; we use criteria such a internal consistency for instances where the true status of genes/proteins is unknown (via Efficiency Analysis, (Jordan et al. 2008&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;)[1&lt;/ins&gt;] a resampling method developed for microarray data normalization + test comparative evaluation).  We used spiked-in sample sets (e.g., protein standards) to explore method space for situation when the true status of a gene/protein is known (e.g., Sultana et al. 2009 &lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;[2&lt;/ins&gt;]).  By comparing various methods under objective criteria, we acquire increased confidence in the interpretation.  Key to this is to build the evaluative comparative capabilities into the analysis software resources we build (e.g., Patel and Lyons-Weiler, 2004&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;)[3&lt;/ins&gt;].&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;I thought I would share our model; I am grateful to Bioinfo-Core for standing up a valuable community resource and hope to look around and learn a bit.  Our Next Generation Sequencing bioinformatics effort is now well underway, and we hope to report to our community of peers what we are learning about optimal analysis settings for QA/QC optimization in the next year or so.  If anyone would care to point me in the direction of empirical evidence for preferred settings for SoLiD sequence data analysis (specifically optimizing read length parameter during an alignment to reference) and tips on how many mismatches are preferred for various applications we would be grateful!&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;I thought I would share our model; I am grateful to Bioinfo-Core for standing up a valuable community resource and hope to look around and learn a bit.  Our Next Generation Sequencing bioinformatics effort is now well underway, and we hope to report to our community of peers what we are learning about optimal analysis settings for QA/QC optimization in the next year or so.  If anyone would care to point me in the direction of empirical evidence for preferred settings for SoLiD sequence data analysis (specifically optimizing read length parameter during an alignment to reference) and tips on how many mismatches are preferred for various applications we would be grateful!&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l13&quot; &gt;Line 13:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 13:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;= References =&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;= References =&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*[&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;[Jordan et al. 2008]&lt;/del&gt;]:Jordan R, Patel S, Hu H, and Lyons-Weiler J. 2008. Efficiency analysis of competing tests for finding differentially expressed genes in lung adenocarcinoma. Cancer Informatics, 6: 389-421.  http://www.la-press.com/journal.php?journal_id=10&amp;amp;issue_id=101&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*[&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;1&lt;/ins&gt;]:Jordan R, Patel S, Hu H, and Lyons-Weiler J. 2008. Efficiency analysis of competing tests for finding differentially expressed genes in lung adenocarcinoma. Cancer Informatics, 6: 389-421.  http://www.la-press.com/journal.php?journal_id=10&amp;amp;issue_id=101&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*[&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;[Patel and Lyons-Weiler, 2004]&lt;/del&gt;]:Patel, S, J Lyons-Weiler. 2004. caGEDA: A web application for the integrated analysis of global gene expression patterns in cancer. Applied Bioinformatics 3:49-62.&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*[&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;2&lt;/ins&gt;]:Patel, S, J Lyons-Weiler. 2004. caGEDA: A web application for the integrated analysis of global gene expression patterns in cancer. Applied Bioinformatics 3:49-62.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt; &lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&#039;diff-marker&#039;&gt;−&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*[&lt;del class=&quot;diffchange diffchange-inline&quot;&gt;[Sultana et al. 2009]&lt;/del&gt;]:Sultana, T., R. Jordan &amp;amp; J. Lyons-Weiler. 2009. Optimization of the use of consensus methods for the detection and putative identification of peptides via mass-spectrometry using protein standard mixtures. J. of Proteomics &amp;amp; Bioinformatics 2:262-273.&lt;/div&gt;&lt;/td&gt;&lt;td class=&#039;diff-marker&#039;&gt;+&lt;/td&gt;&lt;td style=&quot;color: #222; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;*[&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;3&lt;/ins&gt;]:Sultana, T., R. Jordan &amp;amp; J. Lyons-Weiler. 2009. Optimization of the use of consensus methods for the detection and putative identification of peptides via mass-spectrometry using protein standard mixtures. J. of Proteomics &amp;amp; Bioinformatics 2:262-273.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Bioinfocoreguru</name></author>
		
	</entry>
	<entry>
		<id>http://bioinfo-core.org/index.php?title=User_talk:Bioinfocoreguru&amp;diff=6961&amp;oldid=prev</id>
		<title>Bioinfocoreguru: Introduction - Bioinfocoreguru</title>
		<link rel="alternate" type="text/html" href="http://bioinfo-core.org/index.php?title=User_talk:Bioinfocoreguru&amp;diff=6961&amp;oldid=prev"/>
		<updated>2009-09-30T21:01:32Z</updated>

		<summary type="html">&lt;p&gt;Introduction - Bioinfocoreguru&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;Bioinfo-Core:&lt;br /&gt;
&lt;br /&gt;
We handle &amp;gt;20 data analysis project/year with a staff of four at the Bioinformatics Analysis Core at the University of Pittsburgh (Genomics and Proteomics Core Laboratories).  Our model is fee-for-service.  The process involves an orientation meeting w/the investigators to discuss study aims and study design.  We provide advice on study design for free.  We derive a cost estimate for the scope of work (data analysis &amp;amp; research services).  The investigator&amp;#039;s project enters our data analysis queue when the data are produced. We maintain separate queues for microarray data analysis, top-down biomarker development (prediction modeling), proteomics (mass-spec, 2D-DIGE, multiplexed panels), and Next Generation Sequencing (GS-FLX and SoLiD).  Our product is a Preliminary Research Report communicated to the PI in person (via PPT).&lt;br /&gt;
&lt;br /&gt;
A key component of our core is our focus, for each data stream, on continuous methodological optimization.  We call it &amp;quot;Intelligent Objective Methods Optimization&amp;quot;; we use criteria such a internal consistency for instances where the true status of genes/proteins is unknown (via Efficiency Analysis, ([[Jordan et al. 2008]]) a resampling method developed for microarray data normalization + test comparative evaluation).  We used spiked-in sample sets (e.g., protein standards) to explore method space for situation when the true status of a gene/protein is known (e.g., [[Sultana et al. 2009]]).  By comparing various methods under objective criteria, we acquire increased confidence in the interpretation.  Key to this is to build the evaluative comparative capabilities into the analysis software resources we build (e.g., [[Patel and Lyons-Weiler, 2004]]).&lt;br /&gt;
&lt;br /&gt;
I thought I would share our model; I am grateful to Bioinfo-Core for standing up a valuable community resource and hope to look around and learn a bit.  Our Next Generation Sequencing bioinformatics effort is now well underway, and we hope to report to our community of peers what we are learning about optimal analysis settings for QA/QC optimization in the next year or so.  If anyone would care to point me in the direction of empirical evidence for preferred settings for SoLiD sequence data analysis (specifically optimizing read length parameter during an alignment to reference) and tips on how many mismatches are preferred for various applications we would be grateful!&lt;br /&gt;
&lt;br /&gt;
-jlw&lt;br /&gt;
&lt;br /&gt;
PS [http://www.la-press.com/cancer-informatics-journal-j10 Cancer Informatics] is a fine journal for cancer-related informatics developments and research findings.  I&amp;#039;m a proud papa of the journal, so I&amp;#039;m biased.  No conflicts of interest to report. Enjoy.&lt;br /&gt;
&lt;br /&gt;
= References =&lt;br /&gt;
&lt;br /&gt;
*[[Jordan et al. 2008]]:Jordan R, Patel S, Hu H, and Lyons-Weiler J. 2008. Efficiency analysis of competing tests for finding differentially expressed genes in lung adenocarcinoma. Cancer Informatics, 6: 389-421.  http://www.la-press.com/journal.php?journal_id=10&amp;amp;issue_id=101&lt;br /&gt;
&lt;br /&gt;
*[[Patel and Lyons-Weiler, 2004]]:Patel, S, J Lyons-Weiler. 2004. caGEDA: A web application for the integrated analysis of global gene expression patterns in cancer. Applied Bioinformatics 3:49-62.&lt;br /&gt;
&lt;br /&gt;
*[[Sultana et al. 2009]]:Sultana, T., R. Jordan &amp;amp; J. Lyons-Weiler. 2009. Optimization of the use of consensus methods for the detection and putative identification of peptides via mass-spectrometry using protein standard mixtures. J. of Proteomics &amp;amp; Bioinformatics 2:262-273.&lt;/div&gt;</summary>
		<author><name>Bioinfocoreguru</name></author>
		
	</entry>
</feed>