Pages: [1] 2
Saenger
 
Tester - Translator
BAM!ID: 5
Joined: 2006-01-10
Posts: 1735
Credits: 226,945,954
World-rank: 6,528

2007-06-11 17:13:46

SciLINC

Background info here available

Applications for Win and Lin. (Though no WUs are being send atm)

From Background info:
Missouri Botanical Garden proposes Scientific Literature Indexing on Networked Computers (SciLINC), a public-resource computing application that will automatically index large amounts of digitized scientific literature, ultimately providing users with an integrated Web portal, available at www.botanicus.org, for the discovery of information about plants.
Grüße vom Sänger
Trog Dog
 
Volunteer tester
BAM!ID: 206
Joined: 2006-05-11
Posts: 119
Credits: 66,172,318
World-rank: 14,993

2007-06-11 21:50:21

Cheers Saenger
AlphaLaser
BAM!ID: 4118
Joined: 2006-08-16
Posts: 98
Credits: 1,209,990,414
World-rank: 1,956

2007-06-14 06:50:03


... analyze text from digitized botanical literature ...


Quite a unique and original project, I might say.
Guest

2007-06-14 12:54:37
last modified: 2007-06-14 12:55:04

Sever Page shows 57,882 wu's to send, but BOINC gets "No work from Project"

http://www.scilinc.org/SciLINC/server_status.php
Saenger
 
Tester - Translator
BAM!ID: 5
Joined: 2006-01-10
Posts: 1735
Credits: 226,945,954
World-rank: 6,528

2007-06-14 13:18:45

Sever Page shows 57,882 wu's to send, but BOINC gets "No work from Project"

http://www.scilinc.org/SciLINC/server_status.php

It's this way since I signed up on Monday, was probably never any real WU there.

As there are no message boards, nobody will know unless someone a) writes a mail and gets an answer or b) makes fora available.

They have a blog, where you can at least ask questions (and it was already asked). I'll post a link to this thread in there, perhaps someone will come here.
Grüße vom Sänger
Krunchin-Keith [USA]
BAM!ID: 27912
Joined: 2007-06-02
Posts: 86
Credits: 0
World-rank: 0

2007-06-14 22:41:39

Sever Page shows 57,882 wu's to send, but BOINC gets "No work from Project"


I had 153 before for some reason they stopped being sent. They all ran 2 to 5 seconds. Windows OK, Linux had all errors. My credit totals 0.68 for that.

That number has been stuck at 57,882 ready to send since 6/7/2007.

I sent an email but got no response.

Thats all I know.
chrisfreeland
BAM!ID: 28733
Joined: 2007-06-15
Posts: 2
Credits: 0
World-rank: 0

2007-06-15 15:32:07

SciLINC will be sending new wu by Monday, June 18. Apologies for the delay!
Chris Freeland
Saenger
 
Tester - Translator
BAM!ID: 5
Joined: 2006-01-10
Posts: 1735
Credits: 226,945,954
World-rank: 6,528

2007-06-15 15:57:08
last modified: 2007-06-15 15:58:15

SciLINC will be sending new wu by Monday, June 18. Apologies for the delay!
Chris Freeland

No apologies required, but THX anyway.

And THX for the WU (btw: you say "new wu", that's singular, does this mean there will be only one available? )
Grüße vom Sänger
chrisfreeland
BAM!ID: 28733
Joined: 2007-06-15
Posts: 2
Credits: 0
World-rank: 0

2007-06-15 17:41:28

And THX for the WU (btw: you say "new wu", that's singular, does this mean there will be only one available? )


Yes, there will just be the one and you'll all have to fight over it! No, eventually there will be more than 650,000 WUs. In short, we're sending down OCR text from scientific literature and finding strings that match characteristics of taxonomic names. The processing is not intense, so WUs fly through quickly.
Krunchin-Keith [USA]
BAM!ID: 27912
Joined: 2007-06-02
Posts: 86
Credits: 0
World-rank: 0

2007-06-15 21:34:24

I was going to report I tried to contact another person about SciLINC and got a response, but I see he, Chris Freeland, Has already posted here.

What he didn't mention was, the other guy, Ron Parker, went on vacation, which is sort of what I guessed. Figures things go foobar when no one is watching the store.

Thanks Chris for the email.
zombie67
BAM!ID: 1560
Joined: 2006-06-04
Posts: 606
Credits: 183,414,978,579
World-rank: 43

2007-06-15 22:14:44

I just got a bunch of work. But then the server went down, and many of those tasks are stuck in downloading and uploading status. Each one took about a second to run.
Reno, NV
Team: SETI.USA
zombie67
BAM!ID: 1560
Joined: 2006-06-04
Posts: 606
Credits: 183,414,978,579
World-rank: 43

2007-06-16 00:30:48

It occurs to me that,

1) all the work takes only about a second to run

and

2) the leadtime is only 1 day, which means all results downloaded will go into EDF almost immediately, and will return results immediately.

So...their site will be HAMMERED with all the upload/download requests. I wonder if their site is up for that kind of traffic? Perhaps that is why it went down almost immediately after offering units.
Reno, NV
Team: SETI.USA
zombie67
BAM!ID: 1560
Joined: 2006-06-04
Posts: 606
Credits: 183,414,978,579
World-rank: 43

2007-06-16 01:01:20

I had 153 before for some reason they stopped being sent. They all ran 2 to 5 seconds. Windows OK, Linux had all errors. My credit totals 0.68 for that.



Yep. Same for me. Windows works fine, Linux is all errors.

Upon completion, there are around 50 files to upload per result!
Reno, NV
Team: SETI.USA
AlphaLaser
BAM!ID: 4118
Joined: 2006-08-16
Posts: 98
Credits: 1,209,990,414
World-rank: 1,956

2007-06-16 03:24:18

Can anyone else attest to this project possibly causing BOINC daemon memory/CPU usage to balloon? Saying this because of the combination of just receiving many SciLINC WU's and now over a thousand files located in the project directory. I posted to BOINC forums.
Barbud [USA]
 
BAM!ID: 5198
Joined: 2006-09-02
Posts: 83
Credits: 1,445,882
World-rank: 175,020

2007-06-16 03:52:25

Personally I had three computers hooked up to SciLINC and I only noticed something funny when I was checking the weather. The PC acted sluggish and the download was slow. This is on my Dual core machine with a 5MB cable modem, I should not feel anything slow.

After fianlly pausing BOINC I noticed that Boinc was still taking 97% cpu power at all tiomes. It took me several Boinc Shutdowns to reset the scilinc project only after I locked my firewall. Once I reset, I detached the project. Finally my pc is working right.

I do not know about you but I will wait on this project to get a forum and find out what the &$%% is going on. I still can't clear my laptop, I had to pull the battery.
zombie67
BAM!ID: 1560
Joined: 2006-06-04
Posts: 606
Credits: 183,414,978,579
World-rank: 43

2007-06-16 06:30:38

Holy crap! It completely locked up 3 dual core machines. No amount of the usual remedies would solve the problem. Something got completely so FUBAR, I had to uninstall BOINC, trash the remaining BOINC folder, reboot, and reinstall BOINC. What a cluster (!*&!

I had several (of the same) projects running on each of those machines.

So after the surgery, I attached only one of each project on each machine. Yep. It's this one.

So...I still have copies of the BOINC folders from just before I NUKED them.

Here's my problem. I has one Seasonal Attribution Project job on each machine, only about *3 days* away from completion. AAARGH! Any way to recover them? Little help?
Reno, NV
Team: SETI.USA
Shai Hulud
 
BAM!ID: 7901
Joined: 2006-10-06
Posts: 49
Credits: 1,472,797
World-rank: 173,279

2007-06-16 08:08:27
last modified: 2007-06-16 08:10:25

SciLINC locked up my computer also. It downloaded 50 wu's, each taking 6 seconds to calculate on my P3 1GHz.

But in total its been busy/locked up for 8 hours!

Yeah, FUBAR describes it quite well!

I suspended all other projects, set SciLINC to no new work. Now 8 hours later, everythings back to normal again!

My main host downloaded 50 wu's too. But I wont get there within the next 8 hours... *fingers crossed* It has 2 SAPs also (35% and 5%)...
AlphaLaser
BAM!ID: 4118
Joined: 2006-08-16
Posts: 98
Credits: 1,209,990,414
World-rank: 1,956

2007-06-16 08:46:46
last modified: 2007-06-16 08:48:00

So far I'm attributing this to the huge number of files each WU uses/generates. My project directory reached a peak of nearly 10,000 files, and BOINC had to keep track of every single one of them. The issue is that while BOINC is still doing work, it'll be next to impossible to communicate with it via RPC. However, looking at stdoutdae.txt showed that it was still transferring files. With at least a hundred files per WU, its obviously going to take some time for BOINC to get them transferred out, despite each one being only a kilobyte or so in size. So I'm going to let it cool down for the next several hours with every other project suspended and SciLINC set to no new work. I had to stop BOINC and manually edit client_state.xml to do this. Hopefully SciLINC server will be up long enough.

Lesson learned is:

Do NOT attach to SciLINC!

Otherwise it will take over BOINC and you will be forced to either uninstall it or it will steal time from you and your other projects.

To admins: The solution may be to ZIP up files before transferring them. I believe its just the vast number of file transfers BOINC is trying to handle thats grinding it to a halt. If everything is in a zip BOINC will have far less to deal with and reduces server overhead. But please test before handing out the WUs to everyone, this is a very major and easy to catch problem!
Cori
 
BAM!ID: 2
Joined: 2006-01-09
Posts: 1351
Credits: 69,052,230
World-rank: 14,581

2007-06-16 10:46:24
last modified: 2007-06-16 10:46:56

Ok, I see I'm not the only one who had main problems with the up-/dowloading queue. Also the BOINCmanager used over 40MB RAM when it tried to handle this zillions of files.

I finally managed to "kill" BOINC via Windows TaskManager after 30 minutes of upload on dsl for the first 5 or 6 WUs. After restarting BOINC I suspended network and detached. The upload of over 20 WUs of this project would have taken hours I guess. And I won't even imagine how that hurts dial-upper's.

@SciLINC admins: please reduce the number of files for down-/uploading significantly and/or use ZIP as AlphaLaser suggested below!
AND please open your message board for reporting such bugs!
Greetings from Cori
ebahapo
 
BAM!ID: 239
Joined: 2006-05-12
Posts: 662
Credits: 18,928,018
World-rank: 35,256

2007-06-16 14:25:41

FWIW, I managed to keep the impact of returning SciLINC results to a minimum by increasing the number of total file transfers from 8 to 16 as well as the number of file transfers per project from 2 to the same 16 (using BoincView). This way, the BOINC client flushes the queue quickly and does not page fault like mad as before and it doesn't hog the system anymore.

HTH
Shann
Volunteer tester
BAM!ID: 82
Joined: 2006-05-10
Posts: 85
Credits: 10,091,596
World-rank: 54,452

2007-06-16 17:31:13

FWIW, I managed to keep the impact of returning SciLINC results to a minimum by increasing the number of total file transfers from 8 to 16 as well as the number of file transfers per project from 2 to the same 16 (using BoincView). This way, the BOINC client flushes the queue quickly and does not page fault like mad as before and it doesn't hog the system anymore.

HTH


How do you do that ?
Krunchin-Keith [USA]
BAM!ID: 27912
Joined: 2007-06-02
Posts: 86
Credits: 0
World-rank: 0

2007-06-16 18:32:31

I left in run to see how bad it could get.

I found that just running the boinc.exe was better. If I had BOINC manager running it was un-responsive and BOINCview would not update this host. With just the core client BOINCview will update sometimes.

I had done the same thing, but set to max of 8 transfers and only have SciLINC running, all others suspended. I tried 16, but 8 seems better for me. I finally got through all the work. Something like 163 tasks and now that the downloads have stopped and all work done its just uploads left. I was up to something like 5,260 max transfers at one point, and its been running for at least 4 hours since I modified it. IT ran all night in slow motion mode. This is one host.

I want to see how long it takes, from now to clear the last 5,000 uploads.

So this message time represents the 5,000 mark.
ebahapo
 
BAM!ID: 239
Joined: 2006-05-12
Posts: 662
Credits: 18,928,018
World-rank: 35,256

2007-06-16 21:32:52

How do you do that ?

Using the "Open BOINC core-client configuration" button in the "Hosts" toolbar of BoincView (http://boincview.amanheis.de/).

With 16/16 the system is quite responsive and as soon as it gets ahold of the SciLINC server, it depletes the results queue in seconds.

HTH
AlphaLaser
BAM!ID: 4118
Joined: 2006-08-16
Posts: 98
Credits: 1,209,990,414
World-rank: 1,956

2007-06-17 01:37:06

I noted on the thread I posted to the BOINC forums that it took about 16-17 hours for the host to fully recover from the situation. I'm guessing the host pulled between 100-150 WUs, I'd check the results page on SciLINC but it appears down at the moment. BOINCView had trouble contacting the host so I just changed settings by editing files. I pushed my settings to 24/24. I might try an experiment of re-enabling SciLINC with these new transfer settings and monitor closely to see if it is good enough to reduce the backlog that occurred in the first place.
Pages: [1] 2

Index :: New projects :: New project: SciLINC
Reason: