Go Back   { mindfrost82.com } > Gadget Corner > Tech Newsgroups > Microsoft > .NET Framework

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 07-11-2008, 08:01 PM
Hermit Dave
 
Posts: n/a
datatable from xml doc and massive time differences on desktop / server

Hello All,

I have an xml document which can contain massive amounts of data with no
fixed child node sequence.
We have some 9000 odd fields and any ones can be a part of the xml.

The way i build the data table is that i process one record at a time and i
keep a list of columns in hash table (with ordinal position) and in a string
collection (for retrieval in correct order) and for each field, if the
column does not exists, i add the column and for that ordinal position, i
set the value in the data table.

For xml doc contain 85000 odd records with say 5 child fields, on my desktop
i can process the whole thing and display paged in list in 4 mins and 40
secs, however on the server it takes around 12 mins.
Same input file, same code, same binaries.

The desktop which performs faster is Core 2 Duo 2.4 GHz with 2 GB of RAM and
running XP
The Server runs 4 x 2.6 Dual core Opteron with 16 GB of RAM running Windows
2003 Server Enterprise edition.

What do you guys think of it ? TIA

Hermit

PS: Its code in C# on vs.net 2003 using standard frameworkclasses
(datatable, xmltextreader, xmldocument etc)

Reply With Quote
  #2 (permalink)  
Old 07-12-2008, 11:59 AM
Pavel Minaev
 
Posts: n/a
Re: datatable from xml doc and massive time differences on desktop /server

On Jul 12, 12:01*am, "Hermit Dave"
<hermitd.REM...@CAPS.AND.DOTS.hotmail.com> wrote:
> Hello All,
>
> I have an xml document which can contain massive amounts of data with no
> fixed child node sequence.
> We have some 9000 odd fields and any ones can be a part of the xml.
>
> The way i build the data table is that i process one record at a time andi
> keep a list of columns in hash table (with ordinal position) and in a string
> collection (for retrieval in correct order) and for each field, if the
> column does not exists, i add the column and for that ordinal position, i
> set the value in the data table.
>
> For xml doc contain 85000 odd records with say 5 child fields, on my desktop
> i can process the whole thing and display paged in list in 4 mins and 40
> secs, however on the server it takes around 12 mins.
> Same input file, same code, same binaries.
>
> The desktop which performs faster is Core 2 Duo 2.4 GHz with 2 GB of RAM and
> running XP
> The Server runs 4 x 2.6 Dual core Opteron *with 16 GB of RAM running Windows
> 2003 Server Enterprise edition.
>
> What do you guys think of it ? TIA


Where is the source XML read from on your desktop, and on your server?
It could well be an I/O bottleneck, not a problem with your code as
such. Perhaps the server reads it from an SMB share, or something like
that?
Reply With Quote
  #3 (permalink)  
Old 07-14-2008, 07:23 PM
Hermit Dave
 
Posts: n/a
Re: datatable from xml doc and massive time differences on desktop / server

Pavel,

Well the desktop is a standard ide / sata drive.
there server uses SAN.

I dont think its I/O issue as my team leader tried a hard way of assuming a
column exists for a given node (and catching any exceptions and handling it
correctly) and the server timings dropped to about 2 and half mins.
I guess it was something else. Maybe something in the code was being
optimised lot better for intel architecture

Regards,

Hermit

"Pavel Minaev" <int19h@gmail.com> wrote in message
news:655c2d30-56bd-48fe-af4c-d07e117c2139@m3g2000hsc.googlegroups.com...
On Jul 12, 12:01 am, "Hermit Dave"
<hermitd.REM...@CAPS.AND.DOTS.hotmail.com> wrote:
> Hello All,
>
> I have an xml document which can contain massive amounts of data with no
> fixed child node sequence.
> We have some 9000 odd fields and any ones can be a part of the xml.
>
> The way i build the data table is that i process one record at a time and
> i
> keep a list of columns in hash table (with ordinal position) and in a
> string
> collection (for retrieval in correct order) and for each field, if the
> column does not exists, i add the column and for that ordinal position, i
> set the value in the data table.
>
> For xml doc contain 85000 odd records with say 5 child fields, on my
> desktop
> i can process the whole thing and display paged in list in 4 mins and 40
> secs, however on the server it takes around 12 mins.
> Same input file, same code, same binaries.
>
> The desktop which performs faster is Core 2 Duo 2.4 GHz with 2 GB of RAM
> and
> running XP
> The Server runs 4 x 2.6 Dual core Opteron with 16 GB of RAM running
> Windows
> 2003 Server Enterprise edition.
>
> What do you guys think of it ? TIA


Where is the source XML read from on your desktop, and on your server?
It could well be an I/O bottleneck, not a problem with your code as
such. Perhaps the server reads it from an SMB share, or something like
that?

Reply With Quote
Reply

  { mindfrost82.com } > Gadget Corner > Tech Newsgroups > Microsoft > .NET Framework


Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are Off
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT. The time now is 04:36 AM.


Powered by vBulletin, Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Friendly URLs by vBSEO 3.1.0 ©2007, Crawlability, Inc.
© 1999-2008 mindfrost82.com v11.0

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109