Friday, December 15, 2006

Use Numbers

To argue, try to use statements with numbers, like
  • "1000 sites * 20 services * 200 VOs * 10 roles/groups = 40M metadata"
  • "100k CPUs * 10 jobs per day * 10 queries per job = 10M queries per day, 115 queries per second."
  • Publishing one service adds 138 RDF statements to the triple store, which consumes up about 25 KB of memory. Then each triple uses 181 bytes. Each metadata has at least 3 triples: (entity, hasMetadata, metadata), (metadata, hasMetadataType, type), (metadata, hasMetadataValue, value). Thus 4M metadata uses 181 * 3 * 4M = 2.1GB.
And pay attention to detail.

Wednesday, December 06, 2006

Tuesday, December 05, 2006

Create A User Community to Improve Our Open Source Software

Why?
  • Create impact. Open source software's value relies on its usage.
  • Get feedback, which could be good ideas for the future work.
How?
  • Create a well organized website, easy downloads, a bugzilla system, etc. Twiki and Sourceforge are very helpful in this sense.
  • Well answer questions by email.
  • Create a user list, which includes the project name, url, contacts, etc.
BTW, check out OpenCola, which applies the open source idea in making cola.

Monday, December 04, 2006

My New Laptop

Today I got a new laptop, a SONY VAIO, which has
              • Intel® Core™ 2 Duo T7200 Processor 2.0GHz
              • 2GB, 100GB HDD & DVD/RW Super Multi DL Drive
              • 13.3" WXGA TFT Display, Bluetooth & FREE T-Mobile WAN Card
              • ... ...
              • And a fingerprint reader! So that I am able to log in Windows by using my fingerprint. Cool!

Sunday, December 03, 2006

学什么专业?挣多少钱?——美国各级毕业生2006薪水调查

摘自八阕

<CopiedContent>

八阕】美国大学与雇主协会,每年按4个季度分别发布新毕业的大学生、 硕士和博士研究生的薪水调查报告,这报告共涉及80个学士专业,40个硕士专业,23个博士专业。冬、春、夏三季发布的数据是各季度的数据,秋季版发布的 是去年8月31日到今年9月1日为止的整个学年数据。以下列出的是该协会最新公布的美国博士和硕士毕业生2006年秋季薪水调查报告。

美国博士毕业生2006年薪水最高的几种专业:

1、商业管理(Business Administration/Management):$81,438

2、电气/电子与通信工程(Electrical/Electronics & Communications Engineering): $81,297

3、冶金工程(含陶瓷科学与工程)(Metallurgical Engineering ):$80,667

4、计算机科学(Computer Sciences):$76,630

5、化学工程(Chemical Engineering):$75,659

6、计算机工程(Computer Engineering):$74,750

7、工业/制造工程(Industrial/Manufacturing Engineering):$73,292

8、物理(Physics):$72,357

如果这些工程师没有升迁到高层管理,做到退休一般年薪不会超过$150,000

美国博士毕业生2006年薪水最低的几种专业:

1、心理学(Psychology):$49,374

2、人文科学(Humanities):$48,938

3、社会科学(Social Sciences):$48,487

4、生物/生命科学(Biological Sciences/Life Sciences):$43,916

5、英语语言与文学(English Language & Literature/Letters ):$41,405

美国硕士毕业生2006年薪水最高的10种专业:

01、计算机科学(Computer Sciences):$71,165

02、石油工程(Petroleum Engineering):$68,833

03、电气/电子与通信工程(Electrical/Electronics & Communications Engineering):$66,687

04、计算机工程(Computer Engineering):$66,545

05、地质及相关科学(Geological & Related Sciences):$64,111

06、材料科学(Materials Sciences):$63,500

07、航天/航空/宇航工程(Aerospace/Aeronautical/Astronautical Engineering):$62,811

08、工业/制造工程(Industrial/Manufacturing Engineering ):$61,273

09、机械工程(Mechanical Engineering):$61,234

10、化学工程(Chemical Engineering):$59,008

如果这些工程师没有升迁到高层管理,做到退休一般年薪不会超过$120,000

2004年美国律师的年薪平均为:$94,930

最低年薪:$64,620,最高年薪: $143,620

在公司和企业管理层谋职(Management of companies and enterprises):$126,250

在联邦政府谋职(Federal Government):$108,090

在法律机构谋职(Legal services):$99,580

在地方政府谋职(Local government):$73,410

在州政府谋职(State government):$70,280

2004年美国各科医生的年薪:

麻醉科(Anesthesiology):$306,964

外科、普通外科(Surgery, general):$255,438

妇产科(Obstetrics/Gynecology:$233,061

内科(Internal medicine):$155,530

小儿/青少年科(Pediatrics/Adolescent medicine):$152,690

精神科(Psychiatry):$163,144

家庭全科(Family Practice):$150,267

联邦人口普查局2006年11月还发表了一项调查数据“2004年美国学历与平均年薪”:

高中以下学历平均年薪:$19,169

高中毕业平均年薪:  $28,645

大学毕业平均年薪:  $51,554

硕士以上学位平均年薪:$78,093

从以上各项的调查报告看来,华人根深蒂固的教育观念“学好数理化,走遍天下都不怕”,似乎真的放之四海而皆准了。并且,华人父母努力说服自己的子女当律师和医生也是有根有据的。当然,学历越高收入越多的说法更是真理。

</CopiedContent>

我的评论:
  • 选择专业前先要做好各个专业的benchmark, 什么专业工资最高。看来benchmark真是无时无地不重要。
  • 计算机科学博士起薪只比计算机科学硕士高7.7%。虽然理论上博士比硕士有更好的成长空间,究竟读不读博士还是要慎重考虑。
  • 医生收入压倒性的高,远高于其他专业,甚至包括律师。还是学医收入高。
  • 读书有用啊。读书越多,收入越高。
BTW, I read this from Ian Foster's blog:

"Indeed, it was outsiders—those with expertise at the periphery of a problem's field—who were most likely to find answers and do so quickly."

Friday, December 01, 2006

Java I/O

This presentation gives a good overview of Java I/O, from which I have learned:
  • FileInputStream/FileOutputStream: for sequential I/O and simple to buffer
  • RandomAccessFile: for full random-access
  • FileChannel: adds support for NIO byte buffers
  • MappedByteBuffer: for memory-mapped I/O
  • Buffering: for instance, use BufferedOutputStream for buffering FileOutputstream
  • Forcing I/O, e.g. flush() or FileChannel.force(), equals to the "fsync" system call
  • No direct I/O in Java