信息检索：算与启发式方（英文版·第2版）英文原版书（美）格罗斯曼，（美）弗里德　著新华正版

英文原版书新华书店全新正版书籍支持7天无理由

作者: （美）格罗斯曼，（美）弗里德　著
出版社: 人民邮电出版社
ISBN: 9787115212252
出版时间: 2009-10

版次: 1
装帧: 平装
开本: 16开

作者: （美）格罗斯曼，（美）弗里德　著
出版社: 人民邮电出版社
ISBN: 9787115212252

出版时间: 2009-10
版次: 1

装帧: 平装
开本: 16开

售价 38.70 6.6折

定价￥59.00

品相全新品相描述

优惠

运费

本店暂时无法向该地区发货

延迟发货说明

时间：

说明：

上书时间2023-06-01

数量: 仅1件在售，欲购从速

立即购买加入购物车收藏

卖家超过10天未登录

商品详情
店铺评价

手机购买

微信扫码访问

商品分类：

计算机与互联网

货号：

xhwx_11250236

品相描述：全新

正版特价新书

商品描述：

主编：

随着google、百度等搜索引擎公司的崛起，信息检索已经成为令人振奋的热门研究领域。本书从发展的角度描述了adhoc信息检索，讨论了用来实现大规模数据检索的近期新算，详细介绍了推理网络和系统的效率，并且对每种方都给出了详细可行的实例。此外，本书整合了结构化和非结构化数据的处理技术，这是其他教材所不具备的。第2版新增加了ir语言模型和跨语言检索，还讨论了许多当前的热点话题，如xml、p2p信息检索、文本查重、文档并行聚类、不同检索策略的融合、信息中间表示等。本书兼顾了学科广度和主题深度，把握了近期新的发展趋势，是信息检索领域的一本名著，更为许多有名高校（如美国普林斯顿大学、罗格斯大学）采用为教材。

目录：

1. introduction
2. retrieval strategies
  2.1 vector space model
  2.2 probabilistic retrieval strategies
  2.3 language models
  2.4 inference works
  2.5 extended boolean retrieval
  2.6 latent semantic indeng
  2.7 neural works
  2.8 geic algorithms
  2.9 fuzzy set retrieval
  2.10 summary
  2.11 exercises
3. retrieval utilities
  3.1 relevance feedback
  3.2 clustering
  3.3 passage-based retrieval
  3.4 n-grams
  3.5 regression analysis
  3.6 thesauri
  3.7 semantic works
  3.8 parsing
  3.9 summary
  3.10 exercises
4. cross-language information retrieval
  4.1 introduction
  4.2 crossing the language barrier
  4.3 cross-language retrieval strategies
  4.4 cross language utilities
  4.5 summary
  4.6 exercises
5. efficiency
  5.1 inverted index
  5.2 query processing
  5.3 signature files
  5.4 duplicate document detection
  5.5 summary
  5.6 exercises
6. integrating structured data and text
  6.1 review of the relational model
  6.2 a historical progression
  6.3 information retrieval as a relational application
  6.4 semi-structured search using a relational schema
  6.5 multi-dimensional data model
  6.6 mediators
  6.7 summary
  6.8 exercises
7. parallel information retrieval
  7.1 parallel text scanning
  7.2 parallel indeng
  7.3 clustering and classification
  7.4 large parallel systems
  7.5 summary
  7.6 exercises
8. distributed information retrieval
  8.1 a theoretical model of distributed retrieval
  8.2 web search
  8.3 result fusion
  8.4 peer-to-peer information systems
  8.5 other architectures
  8.6 summary
  8.7 exercises
9. summary and future directions
references
index

内容简介：

本书是“信息检索”课程的很好教材，书中对信息检索的概念、和算进行了详细介绍，内容主要包括检索策略、检索实用工具、跨语言信息检索、查询处理、集成结构化及数据和文本、并行信息检索以及分布式信息检索等，并给出了阐述算的大量实例。本书有的深度和广度，而且所有的内容都用当前的技术阐述，是高等院校计算机及信息管理等相关专业本科生和的理想教材，对信息检索领域的科研和技术人员也是很好的参书。

作者简介：

格罗斯曼（davida.grossman），佐治亚梅森大学博士。现在伊利诺伊理工大学计算机系任教。曾在美国部门不错技术服务中心和研究发展办公室担任项目经理。主要研究领域包括信息检索、结构化与非结构化数据集成以及数据挖掘。

精彩内容：

    3.4.1 damore and mah
    initial information retrieval research focused on n-grams as presented in[damore and mah, 1985]. the motivation behind their work was the fact thatit is difficult to develop mathematical models for terms since the potential fora term that has not been seen before is infinite. with n-grams, only a fixednumber of n-grams can est for a given value of n. a mathematical modelwas developed to estimate the noise in indeng and to determine appropriatedocument similarity measures. damore and mahs method replaces terms with n-grams in the vector spacemodel. the only remaining issue is puting the weights for each n-gram.instead of simply using n-gram frequencies, a scaling method is used to nor-malize the length of the document. damore and mahs contention was that alarge document contains more n-grams than a small document, so it should bescaled based on its length. to pute the weights for a given n-gram, damore and mah estimatedthe number of occurrences of an n-gram in a document. the first simplifyingassumption was that n-grams occur with equal likelihood and follow a binomialdistribution. hence, it was no more likely for n-gram "abc" to occur than"dee" the zipfian distribution that is widely accepted for terms is not true forn-grams. damore and mah noted that n-grams are not equally likely to occur,but the removal of frequently occurring terms from the document collectionresulted in n-grams that follow a more binomial distribution than the terms. damore and mah puted the expected number of occurrences of an n-gram in a particular document. this is the product of the number of n-gramsin the document （the document length） and the probability that the n-gramoccurs. the n-grams probability of occurrence is puted as the ratio ofits number of occurrences to the total number of n-grams in the document.damore and mah continued their application of the bino
    ……

精彩书评：

“本书涉及近期新的研究成果，语言经得起推敲，还精心准备了大量的实例说明，适合作为和本科生信息检索课程的优选教材。”——美国马萨诸塞大学阿默斯特校区计算机系杰出教授w.brucecroft“把本书作为计算机科学专业学生的优选教材，同时也适用于se0专业人员和web开发者阅读，将搜索技术，算和启发式方运用于他们的项目中。”——信息技术与服务顾问e.garcia博士
配送说明

...
相似商品
为你推荐

孔网分类

图书

图书

信息检索：算与启发式方（英文版·第2版） 英文原版书 （美）格罗斯曼，（美）弗里德 著 新华正版

孔网啦啦啦啦啦纺织女工火锅店第三课

信息检索：算与启发式方（英文版·第2版）英文原版书（美）格罗斯曼，（美）弗里德　著新华正版