Home
Browse All
DRL Collection Development Policy
Links to non-OHSU Collections
OHSU Campus Building Names
Digital Collections Blog
Log in
|
Help
Search
Advanced Search
Find results with:
error div
Add another field
Search by date
from
after
before
on
to
Searching collections:
Student Scholar Archive
Add or remove collections
Home
Student Scholar Archive
A data cleaning and annotation framework for genome-wide studies
Reference URL
To link to this object, paste this link in email, IM or document
To embed this object, paste this HTML in website
A data cleaning and annotation framework for genome-wide studies
View Description
Download
small (250x250 max)
medium (500x500 max)
large ( > 500x500)
Full Resolution
13191815112007_200711.ramakrishnan.ranjani.ms.pdf
Description
Rights
http://www.ohsu.edu/xd/education/library/services/theses-dissertations/rights-statement.cfm
Title
A
data
cleaning
and
annotation
framework
for
genome-wide
studies
Creator.PersonalName
Ramakrishnan
,
Ranjani
Thesis.Degree
M.S.
Thesis.Major
Computer Science and Engineering
Thesis.DateDegreeAwarded
November
2007
Institution
Oregon Health & Science University
School
OGI School of Science & Engineering
Department
Dept. of Computer Science and Engineering
Thesis.Advisor/Mentor
McWeeney, Shannon K.
Thesis.Committee
Maier, David
Erdoǧmuş, Deniz
Subject.LCSH
Genomics
Bioinformatics
Subject.Keyword
Data
cleaning
;
Annotations
;
Genome-wide
studies
;
Bioinformatics
;
Biological
data
source
Call Number
Q183.5.OGISE R165 2007
Description.Abstract
Genome-wide
studies
are
sensitive
to the
quality
of
annotation
data
included
for
analyses
and they
often
involve
overlaying
both
computationally
derived
and
experimentally
generated
data
onto a
genomic
scaffold
. A
framework
for
successful
integration
of
data
from
diverse
sources
needs
to
address
, at a
minimum
, the
conceptualization
of the
biological
identity
in the
data
sources
, the
relationship
between
the
sources
in
terms
of the
data
present
, the
independence
of the
sources
and, any
discrepancies
in the
data
. The
outcome
of the
process
should
either
resolve
or
incorporate
these
discrepancies
into
downstream
analyses
. In this
thesis
we
identify
factors
that are
important
in
detecting
errors
within
and
between
sources
and
present
a
generalized
framework
to
detect
discrepancies
. An
implementation
of
our
workflow
is
used
to
demonstrate
the
utility
of the
approach
in the
construction
of a
genome-wide
mouse
transcription
factor
binding
map
and in the
classification
of
Single
nucleotide
polymorphisms
.
We
also
present
the
impact
of these
discrepancies
on
downstream
analyses
. The
framework
is
extensible
and
we
discuss
future
directions
including
summarization
of the
discrepancies
in a
biological
relevant
manner
.
Language
eng
Type
Text
Format.Use
Needs Adobe Acrobat Reader to view.
Format.FileType
pdf
Format.FileSize
537.914 KB
OCLC number
182540403
you wish to report:
Your comment:
Your Name:
Submit
Cancel
...
Back to top
Select the collections to add or remove from your search
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
Select All Collections
B
Beauty of the PNW Collection
C
Campus Collection
Classic Article Collection
Clinical Outcomes Research Initiative (CORI)
CSETech
F
FDA Drug Approval Documents
N
Naturopathic Medicine Historical Collection
O
OHSU Historical Collections & Archives
OHSU Oral History Collection
ONPRC Collection
S
Student Scholar Archive
500
You have selected:
1
OK
Cancel