Experience Seamless Full Ebook Downloads for Every Genre at textbookfull.
com
Data Science on AWS Implementing End to End
Continuous AI and Machine Learning Pipelines Early
Edition Chris Fregly
https://textbookfull.com/product/data-science-on-aws-
implementing-end-to-end-continuous-ai-and-machine-learning-
pipelines-early-edition-chris-fregly/
OR CLICK BUTTON
DOWNLOAD NOW
Explore and download more ebook at https://textbookfull.com
Recommended digital products (PDF, EPUB, MOBI) that
you can download immediately if you are interested.
Applied Data Science Using PySpark: Learn the End-to-End
Predictive Model-Building Cycle 1st Edition Ramcharan
Kakarla
https://textbookfull.com/product/applied-data-science-using-pyspark-
learn-the-end-to-end-predictive-model-building-cycle-1st-edition-
ramcharan-kakarla/
textboxfull.com
AI as a Service: Serverless machine learning with AWS 1st
Edition Peter Elger
https://textbookfull.com/product/ai-as-a-service-serverless-machine-
learning-with-aws-1st-edition-peter-elger/
textboxfull.com
Data Engineering with AWS: Acquire the skills to design
and build AWS-based data transformation pipelines like a
pro 2nd Edition Eagar
https://textbookfull.com/product/data-engineering-with-aws-acquire-
the-skills-to-design-and-build-aws-based-data-transformation-
pipelines-like-a-pro-2nd-edition-eagar/
textboxfull.com
Routines for Results A Quick Reference Guidebook of End to
End Solutions to Solidify Your Small Business 1st Edition
Chris Hook
https://textbookfull.com/product/routines-for-results-a-quick-
reference-guidebook-of-end-to-end-solutions-to-solidify-your-small-
business-1st-edition-chris-hook/
textboxfull.com
Machine Learning Q and AI: 30 Essential Questions and
Answers on Machine Learning and AI 1 / converted Edition
Sebastian Raschka
https://textbookfull.com/product/machine-learning-q-and-
ai-30-essential-questions-and-answers-on-machine-learning-and-
ai-1-converted-edition-sebastian-raschka/
textboxfull.com
Vue on Rails: End-to-End Guide to Building Web Apps Using
Vue.js and Rails Bryan Lim
https://textbookfull.com/product/vue-on-rails-end-to-end-guide-to-
building-web-apps-using-vue-js-and-rails-bryan-lim/
textboxfull.com
Making Sense of Sensors: End-to-End Algorithms and
Infrastructure Design from Wearable-Devices to Data Center
1st Edition Omesh Tickoo
https://textbookfull.com/product/making-sense-of-sensors-end-to-end-
algorithms-and-infrastructure-design-from-wearable-devices-to-data-
center-1st-edition-omesh-tickoo/
textboxfull.com
AI as a Service Serverless machine learning with AWS 1st
Edition Peter Elger Eoin Shanaghy Shanaghy Eóin
https://textbookfull.com/product/ai-as-a-service-serverless-machine-
learning-with-aws-1st-edition-peter-elger-eoin-shanaghy-shanaghy-eoin/
textboxfull.com
AWS Automation Cookbook Continuous Integration and
Continuous Deployment using AWS services 1st Edition Nikit
Swaraj
https://textbookfull.com/product/aws-automation-cookbook-continuous-
integration-and-continuous-deployment-using-aws-services-1st-edition-
nikit-swaraj/
textboxfull.com
Data Science on AWS
Implementing End-to-End, Continuous AI and
Machine Learning Pipelines
With Early Release ebooks, you get books in their earliest form—
the author’s raw and unedited content as they write—so you can
take advantage of these technologies long before the official
release of these titles.
Chris Fregly and Antje Barth
Data Science on AWS
by Chris Fregly and Antje Barth
Copyright © 2021 Antje Barth and Flux Capacitor, LLC. All rights
reserved.
Printed in the United States of America.
Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North,
Sebastopol, CA 95472.
O’Reilly books may be purchased for educational, business, or sales
promotional use. Online editions are also available for most titles
(http://oreilly.com). For more information, contact our
corporate/institutional sales department: 800-998-9938 or
[email protected].
Acquisitions Editor: Jessica Haberman
Development Editor: Gary O’Brien
Production Editor: Katherine Tozer
Interior Designer: David Futato
Cover Designer: Karen Montgomery
July 2021: First Edition
Revision History for the Early Release
2020-05-05: First Release
2020-06-10: Second Release
2020-07-17: Third Release
2020-08-03: Fourth Release
2020-08-26: Fifth Release
2020-10-02: Sixth Release
2020-11-30: Seventh Release
See http://oreilly.com/catalog/errata.csp?isbn=9781492079392 for
release details.
The O’Reilly logo is a registered trademark of O’Reilly Media, Inc.
Data Science on AWS, the cover image, and related trade dress are
trademarks of O’Reilly Media, Inc.
The views expressed in this work are those of the authors, and do
not represent the publisher’s views. While the publisher and the
authors have used good faith efforts to ensure that the information
and instructions contained in this work are accurate, the publisher
and the authors disclaim all responsibility for errors or omissions,
including without limitation responsibility for damages resulting from
the use of or reliance on this work. Use of the information and
instructions contained in this work is at your own risk. If any code
samples or other technology this work contains or describes is
subject to open source licenses or the intellectual property rights of
others, it is your responsibility to ensure that your use thereof
complies with such licenses and/or rights.
978-1-492-07932-3
Chapter 1. Automated
Machine Learning
A NOTE FOR EARLY RELEASE READERS
With Early Release ebooks, you get books in their earliest form—
the author’s raw and unedited content as they write—so you can
take advantage of these technologies long before the official
release of these titles.
This will be the 3rd chapter of the final book. Please note that
the GitHub repo will be made active later on.
If you have comments about how we might improve the content
and/or examples in this book, or if you notice missing material
within this chapter, please reach out to the author at
[email protected].
In this chapter, we will show how to use the fully-managed Amazon
AI and machine learning services to offload the undifferentiated
heavy lifting of building AI pipelines. We dive deep into two Amazon
services for automated machine learning, Amazon SageMaker
Autopilot and Amazon Comprehend, designed for users who want to
build powerful predictive models from their datasets with just a few
clicks.
What is Automated Machine Learning?
Automated machine learning (AutoML) commonly refers to the effort
of automating the typical steps of a machine learning pipeline shown
in Figure 1-1.
Figure 1-1. Typical machine learning pipeline.
Machine learning practitioners spend a lot of time building and
managing such pipelines. They need to prepare the data and decide
on the framework and algorithm to use. Seasoned data scientists
use years of experience and intuition to choose the best algorithm
for a given dataset. In an iterative process, ML practitioners try to
find the best performing model configuration called “hyper-
parameters”. Unfortunately, there is no cheat sheet either for
choosing any of these parameters. We still need experience,
intuition, and patience to run many experiments and find the best
hyper-parameters for our algorithm and dataset.
What if we could just use a service that automatically finds and
trains the best model for our dataset and deploys the model to
production with a single click? Amazon SageMaker Autopilot offers
us exactly this functionality. Autopilot simplifies the model training
and tuning processing by handling many aspects of the model
development life cycle (MDLC) including feature transformation,
algorithm selection, model training, tuning, and deployment.
Simply point Autopilot to your dataset - and out comes a set of fully-
trained and optimized predictive models. Autopilot explores many
algorithms and configurations based on many years of AI and
machine learning experience at Amazon. The model candidates are
summarized by Autopilot through a set of generated Jupyter
notebooks and Python scripts. You have full control over these
generated notebooks and scripts. You can modify them, automate
them, and share them with colleagues. You can select the top model
candidate based on your desired balance of model accuracy, model
size, and prediction latency. Let’s dive deeper into the process of
automated machine learning with Autopilot.
Automated Machine Learning with Autopilot
Autopilot is the name of Amazon SageMaker’s AutoML service. You
simply provide your raw data in a S3 bucket, for example in the form
of a tabular CSV file, and tell Autopilot which column you want to
predict. As the name implies, Autopilot then does the rest
automatically.
NOTE
S3 is Amazon’s Simple Storage Service. S3 provides a simple web
service interface that you can use to store and retrieve any amount of
data. We will discuss this service in more detail in the next chapter.
Autopilot uses automated machine learning to analyze the data and
identifies the best algorithm and model configuration for your data
as shown in Figure 1-2.
Figure 1-2. Amazon SageMaker Autopilot.
You can tell Autopilot how many model candidates to explore. In the
process of building those candidates, Autopilot tries different
algorithms and algorithm settings. Autopilot also applies all needed
data transformations to your data to optimize the input for each
algorithm. The algorithm, configuration, and data transformation
code are then combined into a single ML pipeline definition. The
most promising pipelines are selected by Autopilot and used to find
the best performing model. Lastly, Autopilot shares the results in a
model leaderboard. You can use the best performing model as a
baseline and optimize the model even further. A second option is to
simply deploy the model and start predicting.
Another highlight of Autopilot is the fact that it provides full visibility
into each of those steps and shares all code needed to reproduce
the results. AWS calls this a “white-box” approach. This white-box
approach to AutoML is very unique. Let’s explore the white-box vs.
black-box approach to AutoML a bit further.
Understand Autopilot’s White-Box Approach to
AutoML
In a black-box approach as shown in Figure 1-3, you don’t have
control or visibility into the chosen algorithms, applied data
transformations, or hyper-parameter choices. You point the AutoML
service to your data and receive a trained model.
Figure 1-3. Other AutoML products use a black-box approach.
This makes it hard to understand and explain the model, and to
manually reproduce the model. Most AutoML solutions implement
this kind of black-box approach. In contrast, Autopilot documents
and shares its findings throughout the data analysis, feature
engineering, and model tuning steps, as shown in Figure 1-4.
Figure 1-4. Autopilot’s white-box approach to AutoML.
Autopilot not only shares the models, Autopilot also logs all observed
metrics and generates Jupyter notebooks which contain the code to
reproduce the model pipelines. The Data Exploration notebook
contains the data analysis results and identifies potential data quality
issues such as missing values that might impact model performance
if not addressed. The Candidate Definition notebook highlights the
best algorithms to learn our given dataset, as well as the code and
configuration needed to use your dataset with each algorithm.
NOTE
The Jupyter notebooks are available after the first data analysis step.
You can configure Autopilot to stop after this step if you want to iterate
quickly over your data before starting the actual model tuning step.
Use SageMaker Experiments in Autopilot
Autopilot uses SageMaker Experiments to keep track of all data
analysis, feature engineering and model training/tuning jobs. This
feature of the broader Amazon SageMaker family of ML services
helps you organize, track, compare and evaluate machine learning
experiments. SageMaker Experiments enables model versioning and
lineage tracking across all phases of the ML lifecycle.
An experiment consists of trials and training jobs as shown in
Figure 1-5. A trial is a collection of training steps and metadata for
those steps. The training steps typically include data preprocessing,
model training, and model tuning. Metadata includes dataset
locations, algorithm hyper-parameters, output artifacts, and
performance metrics.
Figure 1-5. SageMaker Experiments.
You can explore and manage Autopilot experiments and trials either
through the UI or using the AWS Python SDK boto3
[https://github.com/boto/boto3]. Let’s have a look at both options
and start an Autopilot experiment to build a custom classifier model.
As input data, we leverage samples from the Amazon Customer
Reviews Dataset [https://s3.amazonaws.com/amazon-reviews-
pds/readme.html]. This dataset is a collection of over 150 million
product reviews on Amazon.com from 1995 to 2015. Those product
reviews and star ratings are a popular customer feature of
Amazon.com. Star rating 5 is the best and 1 is the worst. We will
describe and explore this dataset in much more detail in the next
chapters.
Train and Deploy with Autopilot UI
The Autopilot UI is integrated into SageMaker Studio, an IDE which
provides a single, web-based visual interface where you can perform
your machine learning development. Simply navigate to Amazon
SageMaker in your AWS Console and click on SageMaker Studio as
shown in Figure 1-6.
Figure 1-6. AWS Console > Amazon SageMaker > Amazon SageMaker Studio
Then, follow the instructions to set up SageMaker Studio and click
on Open Studio once completed as shown in Figure 1-7.
Figure 1-7. Setup SageMaker Studio
This will take you to the SageMaker Studio IDE as shown in
Figure 1-8.
Figure 1-8. SageMaker Studio IDE
You will find the Autopilot UI when you click the laboratory flask icon
in the left-side menu as shown in Figure 1-9. Once there, click on
Create experiment to create and configure our first Autopilot
experiment.
Figure 1-9. Create an Autopilot experiment.
In preparation for our Autopilot experiment, we use a subset of the
Amazon Customer Reviews Dataset to train our model. We want to
train a classifier to predict the star rating for a given review,
therefore we only use the star_rating and review_body
columns:
star_rating,review_body
5,"GOOD, GREAT, WONDERFUL, MORE THAN ADEQUATE AND EXACTLY
WHAT I NEED FOR MY COMPUTER. NONE BETTER OR GOOD TO USE."
2,"Even though it does the same job as TurboTax, it isn't as
user friendly as TurboTax. I guess you get what you paid
for."
4,"Pretty easy to use. No issues."
…
NOTE
In other scenarios, you will likely want to use more columns from your
dataset and let Autopilot choose the most important ones. In our
example, however, we want to keep things simple and just use the
star_rating and review_body columns to focus on the technical aspects
of Autopilot.
Next, we configure the Autopilot experiment as shown in Figure 1-
10.
Figure 1-10. Configure the Autopilot Experiment.
We just need to provide a few input parameters:
Experiment name: A name to identify the experiment, e.g.
amazon-customer-reviews
Input data location: The S3 path to our training data, e.g.
s3://dsoaws-amazon-
reviews/data/amazon_reviews_us_Digital_Softwa
re_v1_00_header.csv
Target attribute name: The attribute (column) we want to
predict, i.e. star_rating
Output data location: The S3 path for storing the generated
output, such as models and other artifacts, e.g.
s3://dsoaws-amazon-reviews/autopilot/output
Problem type: The machine learning problem type such as
Binary classification, Regression, and Multiclass classification.
The default, “Auto”, allows Autopilot to choose for itself
based on the given input data.
Run complete experiment: We can choose to run a complete
experiment or just generate the Data Exploration and
Candidate Definition notebooks as part of the Data Analysis
phase.
Let’s click Create Experiment and start our first Autopilot job.
You can observe the progress of the job in the UI as shown in
Figure 1-11.
Figure 1-11. Progress of a running Autopilot job.
The UI shows the 3 stages of the Autopilot job: Analyzing Data,
Feature Engineering, and Model Tuning. Once Autopilot completes
the Analyzing Data stage, you can see the links to the two generated
notebooks appearing in the UI as shown in Figure 1-12.
Figure 1-12. Autopilot > Analyzing Data > Generated notebooks
If you have a look at the S3 output bucket, you can find the
generated notebooks, code, and transformed data in the following
structure:
amazon-customer-reviews/
data-processor-models/
amazon-cus-dpp0-1-xxx/
output/model.tar.gz
amazon-cus-dpp1-1-xxx/
output/model.tar.gz
...
preprocessed-data/
header/
headers.csv
tuning_data/
train/
chunk_20.csv
chunk_21.csv
...
validation/
chunk_0.csv
chunk_1.csv
...
sagemaker-automl-candidates/
pr-1-
a1cd024ace184d2fa646384b014a98f3a13ef4fa0cd24f4ebe555ea005/
generated_module/
candidate_data_processors/
dpp0.py
dpp1.py
...
notebooks/
SageMakerAutopilotCandidateDefinitionNotebook.ipynb
SageMakerAutopilotDataExplorationNotebook.ipynb
...
When the Feature Engineering stage starts, you will see SageMaker
training jobs appearing in the AWS Console as shown in Figure 1-13.
Figure 1-13. Autopilot Feature Engineering > Performed By SageMaker Training
Jobs
Each training job is a combination of a model candidate and the data
preprocessor (dpp) code, named dpp0 through dpp9. You can think
of those training jobs as the ten machine learning pipelines Autopilot
built to find the best performing model. You can select any of those
training jobs to view the job status, configuration, parameters, and
log files.
Once the Feature Engineering stage has completed, we can view the
transformed data directly in S3 grouped by pipeline. The data has
been divided into smaller chunks and split into separate train and
validation datasets which we will cover in more detail in the data
preparation chapter.
transformed-data/
dpp0/
rpb/
train/
chunk_20.csv_out
chunk_21.csv_out
...
validation/
chunk_0.csv_out
chunk_1.csv_out
...
dpp1/
csv/
train/
chunk_20.csv_out
chunk_21.csv_out
...
validation/
chunk_0.csv_out
chunk_1.csv_out
...
..
dpp9/
Finally, Autopilot runs the Model Tuning stage as shown in Figure 1-
14. Now, we can actually see the trials appearing in the Autopilot UI.
Another Random Scribd Document
with Unrelated Content
Amblyoponides, 132, 180
Ambrosia, 295
Ambryllis, 409
Amicta quadrangularis, 394
Ammophila, 111;
A. affinis, 111;
A. hirsuta, 111;
A. holosericea, 111
Amphicyrtides, 242
Amphidasis betularia, 412, 414
Amphiodont, 193
Amphipneustic, 450
Amphizoa lecontei, 207
Amphizoidae, 207
Ampulex compressa, 114, 115;
A. ruficornis, 115;
A. sibirica, 114
Ampulicides, 114 f., 169
Amycterides, 291
Anal armature, 328, 416
Anal nervures, 318
Anaphe, 376
Anaspini, 267
Anaspis, 268
Anatomy—see External Structure and Internal Anatomy
Anchor-process, 459
Ancient, Lepidopteron, 435—see also Primitive
Ancylolominae, 425
Andrena, 23, 25, 30, 301, 303;
hair of, 11;
A. labialis, 488;
A. nigroaenea, 23;
A. ovina, 30
Andrenidae, 20
Andrenides, 23
Andrenimorpha, 388
Andrenoides, 20
Androconia, 331 f.
Anergates atratulus, 160 f.
Angelitos, 63
Anisopteryx aescularia, 411
Anisotoma cinnamomea, 222
Anisotomidae, 223
Anisotomides, 222
Anlagen, 143
Anobiides, 247
Anobium, 254;
A. paniceum, 247;
A. striatum, 248;
A. tessellatum, 248
Anochetus ghiliani, 174
Anomma, 178
Anophthalmus, 205
Anoplura, 599 f.
Anosia erippus, A. menippe, 345;
A. plexippus, 307;
larva, 324;
pupa, 327
Antennae;
of Belostoma, 566;
of flies, 441;
of Lamellicornia, 191;
of Lepidoptera, 307;
of butterflies, 340, 341;
of Sphingidae, 380
Antherophagus, 235
Anthicidae, 266
Anthidium, 45;
A. bellicosum, 47;
A. diadema, 45;
A. manicatum, 45;
A. septemdentatum, 47;
A. strigatum, 30
Anthocopa, 51
Anthocoridae, 560
Anthomyia angustifrons, A. brassicae, A. cana, 506
Anthomyiidae, 506
Anthophila, 10 f.
Anthophora, 32, 33;
destroyer of, 272, 274;
A. personata, 33;
A. pilipes, 33;
proboscis of, 17
Anthothrips aculeata, 530
Anthracides, 486
Anthrax, 486 f.;
A. fenestralis, 489;
A. trifasciata, 44
Anthrenus fasciatus, 241
Anthribidae, 278, 290
Antisquama, 448
Antitegula, 447
Ant-plant, 138, 139, 168
Ants, 131 f.;
and Aphidae, 590;
and caterpillars, 356
Ants'-nest, Insects, 200, 213, 221, 223, 224, 225, 231, 236, 240,
548;
larva, 501, 502
Anus, 314, 320
Aorta, 320
Aortal chamber, 320
Apate capucina, 246
Apatela, 418
Apathus = Psithyrus, q.v.
Apatidae, 246
Apatura;
larvae, 354;
A. iris, 344
Apaturides, 352
Aphaenogaster, 164, 165, 221;
A. arenarius, 164;
A. barbarus, 131, 164;
A. structor 164, 240
Aphaniptera, 522 f.
Aphanocephalus, 228
Aphelocheirus, 565
Aphidae, 581 f.;
and ants, 181
Aphidiides, 590
Aphis maidi-radicis, 584
Aphomia sociella, 424
Apidae, 10 f., 20, 32
Apioceridae, 492
Apis, 53;
A. adansonii, 69;
A. domestica, 68;
A. dorsata, 69;
A. fasciata, A. ligustica, 68;
A. mellifica, 65 f.;
feet of queens and workers, 69;
ligula of, 16;
worker and hairs, 12
Apoda, 402;
A. testudo, etc., 401, 402
Apodidae, 402
Apoica pallida, nest, 83
Aporia crataegi, 322
Appetite, 491
Apterogyna, 96
Apterona, 393, 394;
A. crenulella, var. helix, 395
Apterous, beetles, 187, 263;
females, 95, 96, 140, 171, 174, 315, 392, 393, 407, 413, 430,
592—see also Workers;
Insects, 95, 96, 474, 495, 496, 505, 518, 531, 581;
males, 140, 160, 161, 172, 585, 594
Aquatic;
caterpillar, 377;
cocoons, 280;
larva, 421 f., 425, 504;
pupa, 423
Aradidae, 550
Aradus orientalis, 550
Araeocerus, 290
Araschnia levana, A. prorsa, 353
Arbelidae, 369, 396
Arceina, 373
Archiapidae, 21, 22
Archiapides, 21 f.
Arctia caja, 308;
A. villica, 410
Arctiidae, 370, 404, 408, 410
Argiva, 414
Argynnis, larvae, 354;
A. paphia, androconia, 332
Argyromoeba sinuata, 76;
A. trifasciata, 486
Arista, 442
Army-worm, 416
Aromia moschata, 188
Arthropterus, 214
Asclera caerulea, 267
Ascodipteron, 520
Asemorhoptrum lippulum, 160
Asilidae, 491
Asilus, 492;
A. crabroniformis, 441
Asparagus-beetle, 281
Aspidiotus camelliae, 592;
A. nerii, 595, development of, 596
Aspidiphorus, 246
Aspidomorpha, pupa, 283, 284
Association, of Anergates and Tetramorium, 160;
of ants and other Insects etc., 180 f.;
of Formica and Formicoxenus, 159;
of Strongylognathus and Tetramorium, 162;
of Tomognathus and Leptothorax, 161
Astata boops, 119
Astatides, 119
Asteidae, 504
Astomella lindeni, 490
Astynomus, 285
Atemeles, 225;
and ant, 182
Athericerous, 441
Atherix, 481;
A. ibis, 480
Athous rhombeus, 257
Atractocerus, 254
Atrophy of mouth and stomach, 310
Atta, 137, 164 n., 165, 502
Attacus, 373;
A. atlas, 373
Attelabides, 291
Attini, 158, 159, 165
Attitude, 381, 384, 385, 388, 412, 413, 425, 429
Atylotus fulvus, 483
Autocrates aenea, 275
Automeris, 373
Axin, 598
Azteca, 158
Azygos oviduct, 321
Badamia exclamationis, 365
Barrett, on increase of melanism, 414
Basket-worms, 393
Bat-parasites, 521, 560
Bataillon, on metamorphosis, 306
Bates, H. W., on classification of butterflies, 344;
on homoeochromatism, 351;
on Megacephala, 201
Batesian mimicry, 337, 339
Bathyscia, 221
Beak, 532
Beauregard, on vesicating Insects, 275
Beaver, Insect on, 219, 221
Becher, on mouth of Diptera, 444 n.
Bed-bug, 559;
enemy of, 558
Beddard, on animal coloration, 339 n.
Bee (i.e. honey-bee)—see Apis mellifica
Bee-louse or -tick, 520
Bees, 10 f.
Bees born of carcases, myth, 499
Bees' nest beetle, 235
Bees, stylopised, 300, 303
Beetles, 184 f.
Bellesme, on buzzing, 19
Belostomidae, 534, 565
Bembecidae, 482
Bembecides, 119 f.
Bembex, 509;
B. rostrata, 4, 120, 120 f.;
B. spinolae, 130 n.
Benchucha bug, 559
Berosus, 218
Berytidae, 548
Bibio, 475, 476, 477;
B. marci, 477
Bibionidae, 475
Birds and butterflies, 338
Biscuit-weevil, 247
Bitoma crenata, 233
Bittacomorpha, 473
Black-fly, 530
Blanchard, on flies attacking man, 517 n.
Blepharoceridae, 464
Blind beetles, 205, 221, 233
Blissus leucopterus, 548
Blister-beetles, 269
Blochmann, on founding new nests, 145
Blood-sucking, Diptera, 457;
Mosquitoes, 467
Blood-worms, 468
Blow-flies, 511
Blue-bottles, 511
Bogus Yucca-moth, 433
Boletophila luminosa, 463
Boll-worm, 416
Bombardier-beetles, 201
Bombus, 53 f.;
insect in nest of, 221;
fleas in nests of, 525;
parasite of, 94, 497;
proboscis of, 13 f., 14;
B. agrorum, 54;
B. lapidarius, 54;
B. muscorum, 57;
B. variabilis, 60
Bombyces, 367
Bombycidae, 368, 375, 406
Bombyliidae, 485
Bombylius major, 488
Bombyx mori, 375;
B. yamamai, 325
Book-worm, 247
Borboridae, 504
Borborus, 505
Borocera madagascariensis, 405
Bostrichidae, 246
Bot-flies, 514
Brachelytra, 224
Brachycera, 441, 454
Brachycerides, 291
Brachyscelides, 592, 598
Brachytarsus, 290
Braconidae, 590
Bradypus cuculliger, Tineid on, 430
Brahmaeidae, 368, 374
Brain, 320;
cephalic and thoracic, 449
Branchiae, 208, 244
Brands, 332
Brassolides, 349
Brathinus, 223
Brauer, on Dipterous larvae, 451;
on Oestridae, 514
Braula coeca, 520
Braulidae, 520
Breastbone, 459
Breeze-flies, 443, 481
Breitenbach, on proboscis of Lepidoptera, 311 n.
Breithaupt, on proboscis of bee, 15;
on deglutition of bees, 18
Brenthidae, 295
Brenthus anchorago, 297
Brephos notha, 415, 416
Brimstones, 357
Brontes planatus, 234
Brown-tail moths, 407
Bruchidae, 276
Bruchus fabae and B. lentis, 277, B. pisi, 277
Bryophila, 418
Buckell, on development of pattern, 335
Buffalo-gnats, 477
Bugong-moth, 417
Bugonia-myth, 499
Bull-dog ants, 171, 173
Bull's-horn thorn and ants, 168
Bumble bee—see Bombus
Buprestidae, 261
Buprestis attenuate, supposed larvae of, 262 n.
Burgess, on suction, 311
Burnet-moths, 390
Burrows, of Dasypoda, 27;
of Halictus, 24, 25;
of Odynerus, 74
Bursa copulatrix, 321
Burying-beetles, 221
Butterflies, 341 f.
Buzzing, 19
Byrrhidae, 242, 255
Byrrhus pilula, 242
Bythoscopidae, 578
Byturus, 241
Cadphises moorei, 391
Calandrides, 289
Calcium oxalate, 406
Calicurgus, 101;
C. hyalinatus, 102, 106
Caligo eurylochus, 350
Callidea baro, 303 n.
Callidulidae, 370, 400
Calliphora; 448;
C. erythrocephala, C. vomitoria, 511
Callirhipis dejeani, 256
Callomyia, 496
Callostoma fascipennis, 489
Caloptenus, 270;
C. italicus, 489, C. spretus, 488, 506, enemies of
Calypter, calypterate, 448
Calyptrate Muscidae, 448, 504
Camberwell Beauty, 352
Camel bot-fly, 515
Camponotides, 144
Camponotus, 145;
C. ligniperdus, 138, 145, 147;
C. pennsylvanicus, 138, 146;
C. rubripes, 131;
C. rufipes, 137
Camptosomes, 279, 281
Canephorinae, 394, 395
Cantharidae, 269 f.
Cantharides, 270
Capsidae, 561
Capsus laniarius, 539
Carabidae, 204 f.
Carabides, 206
Caraboidea, 190, 200 f.
Carcinocoris, 554
Carder-bees, 45 f., 45
Cardiocondyla, 161
Cardiophorus, 258
Carlet, on sting, 6;
on sound-organs of Cicada, 574
Carnivora, 200
Carotine, 549
Carpenter-bees, 33
Carpenter-worms, 395
Carpets, 411
Carpocapsa juliana, C. pomonella, C. splendana, C. saltitans,
428
Carpophagus, 278
Carteria lacca, 597
Carus, on paedogenesis, 461
Caryoborus, 278
Case, 281, 392, 393, 394, 417, 422, 423, 430, 431
Cassidides, 279, 283
Caste-production, 142
Castnia, 307, 309, 316, 319;
C. eudesmia, C. therapon, 372
Castniidae, 369, 371
Castor canadensis, parasite of, 219
Cataclysta lemnata, 423
Caterpillar, 322, 324, 325;
of Diptera, 474
Catopomorphus, 221
Cauda, 538, 588
Cave-beetles, 205, 221
Cebrionides, 260
Cecidipta excoecaria, 424
Cecidomyia buxi, 459;
C. destructor, 460;
C. tritici, 460
Cecidomyiidae, 455 n., 458
Cecropia, plant and ants, 158
Cedeocera, 297 n.
Cell, of wing, 317, 318;
complete and incomplete, 116 n.
Cells, formation of, by bees, 21, 22, 24, 25, 28, 33, 34, 35, 46,
48, 51, 52, 54, 56, 60;
earthen, 72, 106;
of Coelonites, 89
Celyphidae, 504
Celyphus, 505
Cemonus unicolor, 128
Cephaloidae, 275
Cephalomyia maculata, 515
Cephaloon, 275
Cephalothorax, 465;
of Stylops-larva, 302
Cephenomyia rufibarbis, 517
Cerambycidae, 278, 285
Cerambycides, 287
Ceramius, 89;
C. lusitanicus, 89
Ceranchia, 374
Cerapachys, 175 n.
Ceratina, 11, 32
Ceratocampidae, 368, 375
Ceratocombidae, 559
Ceratognathini, 194, 195
Ceratonema, 401
Ceratopogon, 469;
C. bipunctatus, C. pulicaris, C. varius, 470
Cerceris, 125;
C. arenaria, 125;
C. bupresticida, 125;
C. labiata, 125;
C. tuberculata, 126
Cercopidae, 577
Ceresa bubalus, C. taurina, 577
Cerocoma schaefferi, 275
Cerophytides, 260
Ceroplastes ceriferus, 597
Ceroplatus mastersi, 463
Cerura vinula, 383
Cervical sclerites, 472
Cetonia, in ants'-nests, 149;
C. floricola, 200
Cetoniides, 195, 199
Chaerocampa, 380; recte Choerocampa
Chaetophorous, 446
Chaetotaxy, 446
Chafers, 194 f.
Chalcosiidae, 391, 420
Chalia hockingi, 394
Chalicodoma, 32, 35;
C. muraria, 30, 35 f., 36, 254, 486;
C. parietina, C. pyrenaica, 39
Change of habit in larva, 301, 431
Chapman, Dr. T. A., on Chrysis, 3;
on classification of pupae of Lepidoptera, 367;
on Hepialus, 398;
on Metoecus paradoxus, 268;
on pupa of Lepidoptera, 327 n.
Charagia, 396
Chartergus chartarius, nest of, 82, 83
Cheilosia chrysocoma, 439
Cheimatobia brumata, 414
Cheliomyrmex, 180
Chelonariides, 242
Chelonia, 410
Chelostoma, 35
Chennium bituberculatum, 224
Chermes, 583, 586, 587;
C. abietis, 586, 587, 589
Cheshire, on proboscis of bee, 15
Chigger, 525
Child, on sense-organ, 442
China-marks, 421
Chinch-bug, 548
Chionea araneoides, 474
Chiromyzidae, 479
Chironomidae, 468, 474
Chironomus, 440, 468
Chlamydes, 279
Chlorion, 110ü
Chloropidae, 504
Chlorops, 504
Choerocampa elpenor, 380
Choerocampini, 381
Cholodkovsky on Chermes, 586
Choragus sheppardi, 290
Chorion, 322
Chrysalis, 326, 344
Chrysauginae, 423
Chrysididae, 1 f.
Chrysiridia madagascariensis, 419
Chrysis bidentata, 3;
C. ignita, 3;
C. shanghaiensis, 4
Chrysochus pretiosus, 279
Chrysocoris grandis, 303 n.
Chrysomelidae, 276, 278 f.
Chrysomelides, 279
Chrysopolomidae, 396
Chrysops, 482
Cicada, 123;
C. plebeia, 574;
C. septendecim, 569
Cicadellinae, 578
Cicadidae, 568 f.
Cicindela hybrida, 202
Cicindelidae, 201 f.
Cicinnus, 378 n.
Cilix glaucata = spinula, 401
Cimex, 560;
C. lectularius, 559
Cimicidae, 559
Cioidae, 245
Cis melliei, 245
Cistelidae, 264
Cistus salvifolius, beetle-larvae in, 282
Cithaerias, 348
Citheronia, 375
Cixiides, 576
Cixius, 575
Clambidae, 223
Clasper, 314
Classification, of ants, 144;
of bees, 20;
of butterflies, 341;
of Coleoptera, 189;
of Diptera, 454 f.;
of Hemiptera, 543;
of Hymenoptera Aculeata, 10;
of Lepidoptera, 339 f.;
of moths, 366 f.
Clavicornia, 189, 213, 265
Claviger testaceus, 224
Clavigerides, 224
Clavus, 539
Clear-wings, 386
Cleggs, 481
Cleosiris, 400
Cleptes, 2, 4
Cleridae, 253
Click-beetles, 256
Clicking butterfly, 354
Clidicus, 223
Clisiocampa neustria, 322
Clothes-moths, 430
Clouded-yellows, 357
Clypeus, 307
Clythra in ants'-nests, 149
Clythrides, 279
Cnemidotus caesus, 209
Cnethocampa processionea, 376
Coarctate larva, 271
Coccidae, 592 f.;
destroyer of, 290
Coccidula, 239
Coccinellidae, 237
Coccus cacti, 598;
C. mannifera, 597
Cochineal Insect, 598
Cochliopodidae, 402 n.
Cochlophora, 394
Cockchafer, 198
Cockroach, parasite of, 269
Cock-tail, 225
Cocoon, 46, 55, 66, 109, 122, 328, 347, 373, 376, 384, 385,
391, 403, 404, 405, 407, 419, 424, 436, 460, 462, 494;
flax-seed, 460;
of ants, 134;
aquatic, 280, 377
Cocytia durvillii, 382
Cocytiidae, 382
Codling-moth, 428
Coelioxys, 31
Coelonites, 89;
C. abbreviatus, cells of, 89
Coenomyia ferruginea, 480
Coenomyiidae, 479
Coenonympha, 348
Colaenis, 351
Coleophora, 431
Coleoptera, 184 f.
Colletes, 22;
C. daviesanus, 30
Colobopsis, 138
Colon, 320
Colorado beetle, 278
Colour, corresponding with locale, 201;
and surroundings, 337;
of larva and habits, 336;
of Sphingidae larvae, 381;
physiology of larval, 413;
of caterpillars and sex, 325;
development of, in Hemiptera, 542;
of eyes, 440
Coluocera formicaria, 240
Colydiidae, 233, 234
Colydium, 233
Comb, 63, 64, 65, 78, 79
Combs and brushes, 134
Compound pupa, 452
Composmyia, 512
Comstock, on nervures, 317 n.
Conchylidae, 427
Connexivum, 538
Conopidae, 497, 504
Conorhinus sanguisuga, 559
Copiopteryx, 373
Copium clavicorne, 550
Copius, 547
Copper butterflies, 356
Coprides, 195 f.
Coprini, 195 n.
Copris hispanus, 197
Copulatory pouch, 320, 321
Cordyluridae, 504
Coreidae, 546
Corethra, 467
Corium, 539
Corixa, 567
Corixidae, 567
Corn-leaves, larva on, 281
Coronidia, 419
Corotoca, 227
Corticaria, 240
Corylophidae, 228
Corynetides, 253
Coscinocera hercules, 372
Cossidae, 369, 395
Cossonides, 294
Cossus, 309;
C. ligniperda, 319
Costal nervure, 318
Cotton-stainer bug, 548
Cotton-worm, 416
Courtship, 494;
of Hepialus, 398 f.
Coxa, 307
Crabro, 129;
C. cephalotes, 129;
C. stirpicola, 130 n.
Crabronides, 128 f.
Crambidae, 425
Crane-flies, 471
Cratoparis, 290
Cremaster, 327, 328, 344, 426
Cremastochilini, 200
Cremastochilus, 200
Cremastogaster, 165, 213;
C. tricolor, 165
Crepitation, 213, 214
Criocerides, 279, 280
Crioceris asparagi, 281;
C. merdigera, 281
Crossocerus, 130;
C. wesmaeli, 130
Cryptocephalides, 279
Cryptocephalus, 282
Cryptocerata, 544, 562 f.
Cryptocerini, 132, 134, 158, 159, 169
Cryptocerus, 138;
C. atratus, 170
Cryptophagidae, 235, 237
Cryptophagus dentatus, 235
Cryptostomes, 279, 282
Cteniza ariana, destroyer of, 490
Ctenophora, 475
Ctenostylidae, 517
Ctenuchinae, 409
Cuckoo-bees, 22
Cuckoo-spit, 577
Cucujidae, 232, 234
Cucujos, 258
Cuculinae, 20
Culex pipiens, 466
Culicidae, 466 f.
Cultelli, 443
Cuneus, 539, 540
Cupesidae, 234
Curculionidae, 290
Curtice, on Hypoderma, 516
Curupira, 465
Cut-worms, 415
Cyathoceridae, 243
Cybister laterimarginalis or roeseli, 210;
C. tripunctatus, 211
Cybocephalus, 232
Cyclica, 279, 282
Cyclorrhapha, 454;
C. Aschiza, 455, 494 f.;
C. Schizophora, 456, 503 f.
Cylidrus, 253
Cylindrotomina, 474
Cymatophoridae, 368, 386
Cymbidae, 410
Cynomyia mortuorum, 510
Cyphagogus segnipes, 296
Cyphanta, 368 n.
Cyphonia clavata, 576
Cyphonidae, 255
Cyrtidae, 489
Cyrtocorides, 545
Cyrtocoris monstrosus, 546
Dacnides, 237
Dactylopius citri, 595
Daddy-long-legs, 471
Dakruma coccidivora, 424
Dalla Torre, Catalogue of Hymenoptera, 21
Danaides, 344, 347
Danaioid Heliconiidae, 346
Danais archippus, or plexippus, 345
Dances, 351, 464, 493, 554
Darwin, C., on Pelobius, 208
Darwin, F., on proboscis of Lepidoptera, 311 n.
Dascillidae, 243, 255
Dascillus cervinus, 255
Dasychira pudibunda, 408;
D. rossii, 407
Dasygastres, 20, 35 f.
Dasypoda hirtipes, 27
Welcome to our website – the ideal destination for book lovers and
knowledge seekers. With a mission to inspire endlessly, we offer a
vast collection of books, ranging from classic literary works to
specialized publications, self-development books, and children's
literature. Each book is a new journey of discovery, expanding
knowledge and enriching the soul of the reade
Our website is not just a platform for buying books, but a bridge
connecting readers to the timeless values of culture and wisdom. With
an elegant, user-friendly interface and an intelligent search system,
we are committed to providing a quick and convenient shopping
experience. Additionally, our special promotions and home delivery
services ensure that you save time and fully enjoy the joy of reading.
Let us accompany you on the journey of exploring knowledge and
personal growth!
textbookfull.com