文件名称:
Packtpub.Python.2.6.Text.Processing.Beginners.Guide.Dec.2010
开发工具:
文件大小: 7mb
下载次数: 0
上传时间: 2011-02-11
详细说明: Book Description For programmers, working with text is not about reading their newspaper on a break; it’s about taking textual data in one form and doing something to it. Extract, decrypt, parse, restructure – these are just some of the text tasks that can occupy much of a programmer’s life. If this is your life, this book will make it better – a practical guide on how to do what you want with textual data in Python. Python 2.6 Text Processing Beginner’s Guide is the easiest way to learn how to manipulate text with Python. Pack ed with examples, it will teach you text processing techniques and give you the skills to work with the most popular Python libraries for transforming text from one form to another. The book gets you going with a quick look at some data formats, and installing the supporting libraries and components so that you’re ready to get started. You move on to extracting text from a collection of sources and handling it using Python’s built-in string functions and regular expressions. You look into processing structured text documents such as XML and HTML, JSON, and CSV. Then you progress to generating documents and creating templates. Finally you look at ways to enhance text output via a collection of third-party packages such as Nucular, PyParsing, NLTK, and Mako. Learn text processing techniques and work with the most popular Python libraries for transforming text from one form to another. What you will learn from this book : Know the options available for processing text in Python Parse JSON data that is often used as a data delivery mechanism on the Internet Organize a log-processing application via modules and packages to make it more extensible Perform conditional matches via look-ahead and look-behind assertions by using basic regular expressions Process XML and HTML documents in a variety of ways based on the needs of your application Implement callback methods to perform SAX processing and walk in-memory DOM structures Understand Unicode, character encoding, internationalization, and localization Lay out a Mako template-based project by using techniques such as template inheritance, additional tags, and custom filters Install and use the Mako templating system to create your own Mako templates Process a large number of e-mail messages using the Python standard library and index them with Nucular for fast searching Fix common exceptions that occur while dealing with different types of text encoding Build simple PDF output using the ReportLab toolkit’s high-level PLATYPUS framework Generate Microsoft Excel output using the xlwt module Open and edit existing Open Document files to use them as template sources Understand supporting functions and classes, such as the Python IO system and packaging components Approach This book is part of the Beginner’s Guide series. Each chapter covers the steps for various tasks to process data followed by brief explanation of what is happening in each task. The explanation is followed by a few questions on the topic under discussion that will serve as a refresher course for you. Who this book is written for This book is for people who have text in one format, and need it in another, as quickly as possible. You don’t need any experience with text processing, but you will need some basic knowledge of Python. ...展开收缩
(系统自动生成,下载前可以参看下载内容)
下载文件列表
相关说明
- 本站资源为会员上传分享交流与学习,如有侵犯您的权益,请联系我们删除.
- 本站是交换下载平台,提供交流渠道,下载内容来自于网络,除下载问题外,其它问题请自行百度。
- 本站已设置防盗链,请勿用迅雷、QQ旋风等多线程下载软件下载资源,下载后用WinRAR最新版进行解压.
- 如果您发现内容无法下载,请稍后再次尝试;或者到消费记录里找到下载记录反馈给我们.
- 下载后发现下载的内容跟说明不相乎,请到消费记录里找到下载记录反馈给我们,经确认后退回积分.
- 如下载前有疑问,可以通过点击"提供者"的名字,查看对方的联系方式,联系对方咨询.