《生物信息学数据库》PPT课件

上传人:xian****812 文档编号:292933922 上传时间:2022-05-15 格式:PPT 页数:225 大小:13.13MB
返回 下载 相关 举报
《生物信息学数据库》PPT课件_第1页
第1页 / 共225页
《生物信息学数据库》PPT课件_第2页
第2页 / 共225页
《生物信息学数据库》PPT课件_第3页
第3页 / 共225页
《生物信息学数据库》PPT课件_第4页
第4页 / 共225页
《生物信息学数据库》PPT课件_第5页
第5页 / 共225页
点击查看更多>>
资源描述

《《生物信息学数据库》PPT课件》由会员分享,可在线阅读,更多相关《《生物信息学数据库》PPT课件(225页珍藏版)》请在金锄头文库上搜索。

1、DatabasesforBioinformatics陈艳炯医学院免疫与病原生物学系数据库系统基础数据库系统基础数据库的基本概念数据管理系统的发展数据库技术的发展数据库系统的组成数据库应用系统体系结构数据数据(Data)数据的定义数据的定义 描述客观事物描述客观事物( (对象对象) )的符号记录的符号记录数据的种类数据的种类 文字、图形、图像、声音文字、图形、图像、声音 数据的特点数据的特点 数据与其语义是不可分的数据与其语义是不可分的DataThetermdatameansgroupsofinformationthatrepresentthequalitativeorquantitativea

2、ttributesofavariableorsetofvariables.Data(pluralofdatum,whichisseldomused)aretypicallytheresultsofmeasurementsandcanbethebasisofgraphs,images,orobservationsofasetofvariables.Dataareoftenviewedasthelowestlevelofabstractionfromwhichinformationandknowledgearederived.数据概念的变化特点数据概念的变化特点质的规定:由简单到集成;由私有到共享

3、。质的规定:由简单到集成;由私有到共享。量的刻化:由量的刻化:由小量小量到到大量大量到到海量海量。所处位置:在软件中的从属地位到主导地位。所处位置:在软件中的从属地位到主导地位。信息信息(Information) 是以数据为载体的对客是以数据为载体的对客观世界实际存在的事物、事件和概念的抽象观世界实际存在的事物、事件和概念的抽象反应。反应。信息信息=数据数据+数据处理数据处理 DataprocessingComputer data processingisanyprocessthatusesacomputerprogramtoenterdataandsummarise,analyseoroth

4、erwiseconvertdataintousableinformation.Theprocessmaybeautomatedandrunonacomputer.Itinvolvesrecording,analysing,sorting,summarising,calculating,disseminatingandstoringdata.Becausedataaremostusefulwhenwell-presentedandactuallyinformative,data-processingsystemsareoftenreferredtoasinformationsystems.Dat

5、a analysisWhenthedomainfromwhichthedataareharvestedisascienceoranengineering,dataprocessingandinformationsystemsareconsideredtoobroadoftermsandthemorespecializedtermdataanalysisistypicallyused,focusingonthehighly-specializedandhighly-accuratealgorithmicderivationsandstatisticalcalculationsthatareles

6、softenobservedinthetypicalgeneralbusinessenvironment.DataanalysispackageslikeDAP,gretlorPSPPareoftenused.ElementsofdataprocessingInordertobeprocessedbyacomputer,dataneedsfirstbeconvertedintoamachinereadableformat.Oncedataisindigitalformat,variousprocedurescanbeappliedonthedatatogetusefulinformation.

7、Dataprocessingmayinvolvevariousprocesses,including:Dataacquisition(数据采集)Dataentry(数据录入)Datacleaning(数据清理)Datavalidation(数据验证)Datatabulation(数据制表)Statisticalanalysis(统计分析)Computergraphics(计算机图形)Datawarehousing(数据存储)Datamining(数据挖掘)DataacquisitionIncomputerdataprocessing,data acquisitionisthesamplingo

8、frealworldphysicalconditionsandconversionoftheresultingsamplesintodigitalnumericvaluesthatcanbemanipulatedbyacomputer.Thecomponentsofdataacquisitionsystemsinclude:Sensorsthatconvertphysicalparameterstoelectricalsignals.Signalconditioningcircuitrytocoercesensorsignalsintoaformthatcanbeconvertedtodigi

9、talvalues.Analog-to-digitalconverters,whichconvertconditionedsensorsignalstodigitalvalues.Dependingontheapplication,acquireddatamaybedisplayed,analyzed,orrecorded,orsomecombinationthereof.DataacquisitionapplicationsmaybecontrolledbycommercialDAQsoftwareorbycustomprogramsdevelopedusingvariousgeneralp

10、urposeprogramminglanguagessuchasBASICorC.SpecializedprogramminglanguagesusedfordataacquisitionincludeEPICSforbuildinglargescaledataacquisitionsystems,LabVIEW,whichoffersagraphicalprogrammingenvironment,andMATLABwhichprovidesgraphicaltoolsandlibrariesfordataacquisitionandanalysis.Data cleansingordata

11、 scrubbingistheactofdetectingandcorrecting(orremoving)corruptorinaccuraterecordsfromarecordset,table,ordatabase.Usedmainlyindatabases,thetermreferstoidentifyingincomplete,incorrect,inaccurate,irrelevantetc.partsofthedataandthenreplacing,modifyingordeletingthisdirty data.Aftercleansing,adatasetwillbe

12、consistentwithothersimilardatasetsinthesystem.Theinconsistenciesdetectedorremovedmayhavebeenoriginallycausedbydifferentdatadictionarydefinitionsofsimilarentitiesindifferentstores,mayhavebeencausedbyuserentryerrors,ormayhavebeencorruptedintransmissionorstorage.Datacleansingdiffersfromdatavalidationin

13、thatvalidationalmostinvariablymeansdataisrejectedfromthesystematentryandisperformedatentrytime,ratherthanonbatchesofdata.Theactualprocessofdatacleansingmayinvolveremovingtypographicalerrorsorvalidatingandcorrectingvaluesagainstaknownlistofentities.Thevalidationmaybestrict(suchasrejectinganyaddressth

14、atdoesnothaveavalidpostalcode)orfuzzy(suchascorrectingrecordsthatpartiallymatchexisting,knownrecords).Adata entry clerkisamemberofstaffwhoreadshand-writtenorprintedrecordsandtypesthemintoacomputer.Theyaresometimesemployedonatemporarybasis,butmostlargecompanieswhichhavelargeamountsofdatawillhireonane

15、ar-permanentbasis.Incomputerscience,data validationistheprocessofensuringthataprogramoperatesonclean,correctandusefuldata.Itusesroutines,oftencalledvalidationrulesorcheckroutines,thatcheckforcorrectness,meaningfulness,andsecurityofdatathatareinputtothesystem.Therulesmaybeimplementedthroughtheautomat

16、edfacilitiesofadatadictionary,orbytheinclusionofexplicitapplicationprogramvalidationlogic.Incorrectdatavalidationcanleadtodatacorruptionorasecurityvulnerability.Datavalidationchecksthatdataarevalid,sensible,reasonable,andsecurebeforetheyareprocessed.Computer graphicsaregraphicscreatedusingcomputersand,moregenerally,therepresentationandmanipulationofpictorialdatabyacomputer.Thedevelopmentofcomputergraphics,orsimplyreferredtoasCG,hasmadecomputerseasiertointeractwith,andbetterforunderstandingandint

展开阅读全文
相关资源
相关搜索

当前位置:首页 > 中学教育 > 教学课件 > 高中课件

电脑版 |金锄头文库版权所有
经营许可证:蜀ICP备13022795号 | 川公网安备 51140202000112号