MPDL IT Infrastructure/Planning/PubMan

MPDL see also MPDL IT Infrastructure/Planning =eDoc=
 * No of Institutes: 95
 * Publications: 273763 (all statusses)
 * Avg/Institute: 2881
 * Only MPG publications


 * No of registered users: 591
 * Avg/Institute: 6

=Current state=
 * Publications: 15540
 * No of institutes: 4 (incl. NIMS)
 * Avg/Institute: 3885
 * External publications also included
 * No of registered users: 869
 * Avg/Institute: 200 (researchers also)
 * Avgerage logged-in-users/hour: 17 .. ca 4/Institute

Storage
5MB / Publication Total estimate for 20% growth per year: 3T


 * PgSql: 7.2G (can not estimate, as future core services will be based on Lucene), mostly statistics data
 * Avg: 0,06 ;B/ Publication data + publication data indexes
 * Fedora: ca 54G
 * Avg: 3.56 MB/Publication
 * Lucene: 1.2 G
 * Avg: 0.08MB/Publication (not included Administrative indexes, duplicated at minimum)
 * Statistics: 5GB (separately to be calculated, will have new storage concept in next core service)

Storage estimate

 * Expected: All publications from eDoc 273763 + growth 20% in the first year + 20% growth every subsequent year
 * Initial storage required:
 * ca: 1.3 TB
 * growth 20% in the first year => ca. 1.5 TB
 * growth 20% in the second year => ca. 1.9 TB
 * growth 20% in the third year => ca. 2.7 TB

Load average

 * srv02, srv01 - biggest load during recache, reindex, migration - otherwise below 0.3
 * srv03 - below 3 (but above 2) - could be improved or re-shuffling of cone/postgres database biggest load most probably during imports

Memory consumption

 * srv01 - 32GB - swapping heavily (maybe related to coreservice 1.2) - to be investigated, was not happening with core 1.1
 * srv02 - 32GB - average used 13GB (last 3 months)
 * srv03 - 16GB - average used 13GB (last 3 months) - could use a bit more memory with current PubMan/Cone/Postgres combination