香蕉影院在线,亚洲女人视频,欧美亚洲综合网

高手戲玩c++

Posted on 2007-02-26 20:58 morphis 閱讀(268) 評論(0) 編輯收藏所屬分類: 1. C/Cpp

toupper,tolower
地球人都知道 C++ 的 string 沒有 toupper ，好在這不是個大問題，因為我們有 STL 算法：

string s("heLLo");
transform(s.begin(), s.end(), s.begin(), ::toupper);
cout << s << endl;
transform(s.begin(), s.end(), s.begin(), ::tolower);
cout << s << endl;

當然，我知道很多人希望的是 s.to_upper() ，但是對于一個這么通用的 basic_string 來說，的確沒辦法把這些專有的方法放進來。如果你用 boost stringalgo ，那當然不在話下，你也就不需要讀這篇文章了。

------------------------------------------------------------------------
trim
我們還知道 string 沒有 trim ，不過自力更生也不困難，比 toupper 來的還要簡單：

??? string s("?? hello?? ");
??? s.erase(0, s.find_first_not_of(" \n"));
??? cout << s << endl;
??? s.erase(s.find_last_not_of(' ') + 1);
??? cout << s << endl;

注意由于 find_first_not_of 和 find_last_not_of 都可以接受字符串，這個時候它們尋找該字符串中所有字符的 absence ，所以你可以一次 trim 掉多種字符。

-----------------------------------------------------------------------
erase
string 本身的 erase 還是不錯的，但是只能 erase 連續字符，如果要拿掉一個字符串里面所有的某個字符呢？用 STL 的 erase + remove_if 就可以了，注意光 remove_if 是不行的。

??? string s("?? hello, world. say bye?? ");
??? s.erase(remove_if(s.begin(),s.end(),
??????? bind2nd(equal_to<char>(), ' ')),
??? s.end());

上面的這段會拿掉所有的空格，于是得到 hello,world.saybye。

-----------------------------------------------------------------------
replace
string 本身提供了 replace ，不過并不是面向字符串的，譬如我們最常用的把一個 substr 換成另一個 substr 的操作，就要做一點小組合：

??? string s("hello, world");
??? string sub("ello, ");
??? s.replace(s.find(sub), sub.size(), "appy ");
??? cout << s << endl;

輸出為 happy world。注意原來的那個 substr 和替換的 substr 并不一定要一樣長。

-----------------------------------------------------------------------
startwith, endwith
這兩個可真常用，不過如果你仔細看看 string 的接口，就會發現其實沒必要專門提供這兩個方法，已經有的接口可以干得很好：

??? string s("hello, world");
??? string head("hello");
??? string tail("ld");
??? bool startwith = s.compare(0, head.size(), head) == 0;
??? cout << boolalpha << startwith << endl;
??? bool endwith = s.compare(s.size() - tail.size(), tail.size(), tail) == 0;
??? cout << boolalpha << endwith << endl;

當然了，沒有 s.startwith("hello") 這樣方便。

------------------------------------------------------------------------
toint, todouble, tobool...
這也是老生常談了，無論是 C 的方法還是 C++ 的方法都可以，各有特色：

??? string s("123");
??? int i = atoi(s.c_str());
??? cout << i << endl;
???
??? int ii;
??? stringstream(s) >> ii;
??? cout << ii << endl;
???
??? string sd("12.3");
??? double d = atof(sd.c_str());
??? cout << d << endl;
???
??? double dd;
??? stringstream(sd) >> dd;
??? cout << dd << endl;
???
??? string sb("true");
??? bool b;
??? stringstream(sb) >> boolalpha >> b;
??? cout << boolalpha << b << endl;

C 的方法很簡潔，而且賦值與轉換在一句里面完成，而 C++ 的方法很通用。

------------------------------------------------------------------------
split
這可是件麻煩事，我們最希望的是這樣一個接口： s.split(vect, ',') 。用 STL 算法來做有一定難度，我們可以從簡單的開始，如果分隔符是空格、tab 和回車之類，那么這樣就夠了：

??? string s("hello world, bye.");
??? vector<string> vect;
??? vect.assign(
??????? istream_iterator<string>(stringstream(s)),
??????? istream_iterator<string>()
??? );

不過要注意，如果 s 很大，那么會有效率上的隱憂，因為 stringstream 會 copy 一份 string 給自己用。

------------------------------------------------------------------------
concat
把一個裝有 string 的容器里面所有的 string 連接起來，怎么做？希望你不要說是 hand code 循環，這樣做不是更好？

??? vector<string> vect;
??? vect.push_back("hello");
??? vect.push_back(", ");
??? vect.push_back("world");
???
??? cout << accumulate(vect.begin(), vect.end(), string(""));

不過在效率上比較有優化余地。

-------------------------------------------------------------------------

reverse
其實我比較懷疑有什么人需要真的去 reverse 一個 string ，不過做這件事情的確是很容易：

? std::reverse(s.begin(), s.end());

上面是原地反轉的方法，如果需要反轉到別的 string 里面，一樣簡單：

? s1.assign(s.rbegin(), s.rend());

效率也相當理想。

-------------------------------------------------------------------------

解析文件擴展名
字數多點的寫法：

??? std::string filename("hello.exe");

??? std::string::size_type pos = filename.rfind('.');
??? std::string ext = filename.substr(pos == std::string::npos ? filename.length() : pos + 1);

不過兩行，合并成一行呢？也不是不可以：

??? std::string ext = filename.substr(filename.rfind('.') == std::string::npos ? filename.length() : filename.rfind('.') + 1);

我知道，rfind 執行了兩次。不過第一，你可以希望編譯器把它優化掉，其次，擴展名一般都很短，即便多執行一次，區別應該是相當微小。

STL 算法

distance
很多時候我們希望在一個 vector ，或者 list ，或者什么其他東西里面，找到一個值在哪個位置，這個時候 find 幫不上忙，而有人就轉而求助手寫循環了，而且是原始的手寫循環：

for ( int i = 0; i < vect.size(); ++i)
??? if ( vect[i] == value ) break;

如果編譯器把 i 看作 for scope 的一部分，你還要把 i 的聲明拿出去。真的需要這樣么？看看這個：

??? int dist =
??????? distance(col.begin(),
??????????? find(col.begin(), col.end(), 5));

其中 col 可以是很多容器，list, vector, deque... 當然這是你確定 5 就在 col 里面的情形，如果你不確定，那就加點判斷：

??? int dist;
??? list<int>::iterator pos = find(col.begin(), col.end(), 5);
??? if ( pos != col.end() )
??????? dist = distance(col.begin(), pos);

我想這還是比手寫循環來的好些吧。

--------------------------------------------------------------------------
max, min
這是有直接的算法支持的，當然復雜度是 O(n)，用于未排序容器，如果是排序容器...老兄，那還需要什么算法么？

max_element(col.begin(), col.end());
min_element(col.begin(), col.end());

注意返回的是 iterator ，如果你關心的只是值，那么好：

*max_element(col.begin(), col.end());
*min_element(col.begin(), col.end());

max_element 和 min_element 都默認用 less 來排序，它們也都接受一個 binary predicate ，如果你足夠無聊，甚至可以把 max_element 當成 min_element 來用，或者反之：

*max_element(col.begin(), col.end(), greater<int>()); // 返回最小值！
*min_element(col.begin(), col.end(), greater<int>()); // 返回最大值

當然它們的本意不是這個，而是讓你能在比較特殊的情況下使用它們，例如，你要比較的是每個元素的某個成員，或者成員函數的返回值。例如：

#include <iostream>
#include <list>
#include <algorithm>
#include <string>
#include <boost/bind.hpp>

using namespace boost;
using namespace std;

struct Person
{
??? Person(const string& _name, int _age)
??????? : name(_name), age(_age)
??? {}
??? int age;
??? string name;
};

int main()
{
??? list<Person> col;
??? list<Person>::iterator pos;

??? col.push_back(Person("Tom", 10));
??? col.push_back(Person("Jerry", 12));
??? col.push_back(Person("Mickey", 9));

??? Person eldest =
??????? *max_element(col.begin(), col.end(),
??????????? bind(&Person::age, _1) < bind(&Person::age, _2));//>=1.33
???
??? cout << eldest.name;
}

輸出是 Jerry ，這里用了 boost.bind ，原諒我不知道用 bind2nd, mem_fun 怎么寫，我也不想知道...

-------------------------------------------------------------------------
copy_if
沒錯，STL 里面壓根沒有 copy_if ，這就是為什么我們需要這個：

template<typename InputIterator, typename OutputIterator, typename Predicate>
OutputIterator copy_if(
??? InputIterator begin, InputIterator end, OutputIterator destBegin, Predicate p)
{
??? while (begin != end)
??? {
??????? if (p(*begin))*destBegin++ = *begin;
??????? ++begin;
??? }
??? return destBegin;
}

把它放在自己的工具箱里，是一個明智的選擇。

------------------------------------------------------------------------
慣用手法：erase(iter++)
如果你要去除一個 list 中的某些元素，那可千萬小心：（下面的代碼是錯的！！！）

#include <iostream>
#include <algorithm>
#include <iterator>
#include <list>

int main()
{
??? int arr[] = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10};
??? std::list<int> lst(arr, arr + 10);

??? for ( std::list<int>::iterator iter = lst.begin();
????????? iter != lst.end(); ++iter)
??????? if ( *iter % 2 == 0 )
??????????? lst.erase(iter);
???????????
??? std::copy(lst.begin(), lst.end(),
??????? std::ostream_iterator<int>(std::cout, " "));
}

當 iter 被 erase 掉的時候，它已經失效，而后面卻還會做 ++iter ，其行為無可預期！如果你不想動用 remove_if ，那么唯一的選擇就是：

#include <iostream>
#include <algorithm>
#include <iterator>
#include <list>

int main()
{
??? int arr[] = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10};
??? std::list<int> lst(arr, arr + 10);

??? for ( std::list<int>::iterator iter = lst.begin();
????????? iter != lst.end(); )
??????? if ( *iter % 2 == 0 )
??????????? lst.erase(iter++);
??????? else
??????????? ++iter;
??????????
??? std::copy(lst.begin(), lst.end(),
??????? std::ostream_iterator<int>(std::cout, " "));
}

但是上面的代碼不能用于 vector, string 和 deque ，因為對于這些容器， erase 不光令 iter 失效，還令 iter 之后的所有 iterator 失效！

-------------------------------------------------------------------------
erase(remove...) 慣用手法
上面的循環如此難寫，如此不通用，如此不容易理解，還是用 STL 算法來的好，但是注意，光 remove_if 是沒用的，必須使用 erase(remove...) 慣用手法：

#include <iostream>
#include <algorithm>
#include <iterator>
#include <list>
#include <functional>
#include <boost/bind.hpp>

int main()
{
??? int arr[] = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10};
??? std::list<int> lst(arr, arr + 10);

??? lst.erase(remove_if(lst.begin(), lst.end(),
??????? boost::bind(std::modulus<int>(), _1, 2) == 0),
??????? lst.end()
??? );
??????????
??? std::copy(lst.begin(), lst.end(),
??????? std::ostream_iterator<int>(std::cout, " "));
}

當然，這里借助了 boost.bind ，讓我們不用多寫一個沒用的 functor 。

簡單常識——關于stream
從文件中讀入一行

簡單，這樣就行了：

ifstream ifs("input.txt");
char buf[1000];

ifs.getline(buf, sizeof buf);

string input(buf);

當然，這樣沒有錯，但是包含不必要的繁瑣和拷貝，況且，如果一行超過1000個字符，就必須用一個循環和更麻煩的緩沖管理。下面這樣豈不是更簡單？

string input;
input.reserve(1000);
ifstream ifs("input.txt");
getline(ifs, input);

不僅簡單，而且安全，因為全局函數 getline 會幫你處理緩沖區用完之類的麻煩，如果你不希望空間分配發生的太頻繁，只需要多 reserve 一點空間。

這就是“簡單常識”的含義，很多東西已經在那里，只是我一直沒去用。

---------------------------------------------------------------------------

一次把整個文件讀入一個 string

我希望你的答案不要是這樣：

string input;
while( !ifs.eof() )
{
??? string line;
??? getline(ifs, line);
??? input.append(line).append(1, '\n');
}

當然了，沒有錯，它能工作，但是下面的辦法是不是更加符合 C++ 的精神呢？

string input(
??? istreambuf_iterator<char>(instream.rdbuf()),
??? istreambuf_iterator<char>()
);

同樣，事先分配空間對于性能可能有潛在的好處：

string input;
input.reserve(10000);
input.assign(
??? istreambuf_iterator<char>(ifs.rdbuf()),
??? istreambuf_iterator<char>()
);

很簡單，不是么？但是這些卻是我們經常忽略的事實。
補充一下，這樣干是有問題的：

??? string input;
??? input.assign(
??????? istream_iterator<char>(ifs),
??????? istream_iterator<char>()
??? );

因為它會忽略所有的分隔符，你會得到一個純“字符”的字符串。最后，如果你只是想把一個文件的內容讀到另一個流，那沒有比這更快的了：

??? fstream fs("temp.txt");
??? cout << fs.rdbuf();

因此，如果你要手工 copy 文件，這是最好的（如果不用操作系統的 API）：

?? ifstream ifs("in.txt");
?? ofstream ofs("out.txt");
?? ofs << in.rdbuf();

-------------------------------------------------------------------------

open 一個文件的那些選項

ios::in????Open file for reading
ios::out??? Open file for writing
ios::ate??? Initial position: end of file
ios::app??? Every output is appended at the end of file
ios::trunc? If the file already existed it is erased
ios::binary Binary mode

-------------------------------------------------------------------------

還有 ios 的那些 flag

flag effect if set

ios_base::boolalpha input/output bool objects as alphabetic names (true, false).

ios_base::dec input/output integer in decimal base format.

ios_base::fixed output floating point values in fixed-point notation.

ios_base::hex input/output integer in hexadecimal base format.

ios_base::internal the output is filled at an internal point enlarging the output up to the field width.

ios_base::left the output is filled at the end enlarging the output up to the field width.

ios_base::oct input/output integer in octal base format.

ios_base::right the output is filled at the beginning enlarging the output up to the field width.

ios_base::scientific output floating-point values in scientific notation.

ios_base::showbase output integer values preceded by the numeric base.

ios_base::showpoint output floating-point values including always the decimal point.

ios_base::showpos output non-negative numeric preceded by a plus sign (+).

ios_base::skipws skip leading whitespaces on certain input operations.

ios_base::unitbuf flush output after each inserting operation.

ios_base::uppercase output uppercase letters replacing certain lowercase letters.

There are also defined three other constants that can be used as masks:

constant value

ios_base::adjustfield left | right | internal

ios_base::basefield dec | oct | hex

ios_base::floatfield scientific | fixed

--------------------------------------------------------------------------

用我想要的分隔符來解析一個字符串，以及從流中讀取數據

這曾經是一個需要不少麻煩的話題，由于其常用而顯得尤其麻煩，但是其實 getline 可以做得不錯：

??? getline(cin, s, ';');???
??? while ( s != "quit" )
??? {
??????? cout << s << endl;
??????? getline(cin, s, ';');
??? }

簡單吧？不過注意，由于這個時候 getline 只把 ; 作為分隔符，所以你需要用 ;quit; 來結束輸入，否則 getline 會把前后的空格和回車都讀入 s ，當然，這個問題可以在代碼里面解決。

同樣，對于簡單的字符串解析，我們是不大需要動用什么 Tokenizer 之類的東西了：

#include <iostream>
#include <sstream>
#include <string>

using namespace std;

int main()
{
??? string s("hello,world, this is a sentence; and a word, end.");
??? stringstream ss(s);
???
??? for ( ; ; )
??? {
??????? string token;
??????? getline(ss, token, ',');
??????? if ( ss.fail() ) break;
???????
??????? cout << token << endl;
??? }
}

輸出：

hello
world
?this is a sentence; and a word
?end.

很漂亮不是么？不過這么干的缺陷在于，只有一個字符可以作為分隔符。

--------------------------------------------------------------------------

把原本輸出到屏幕的東西輸出到文件，不用到處去把 cout 改成 fs

#include <iostream>
#include <fstream>

using namespace std;

int main()
{????
??? ofstream outf("out.txt");?
??? streambuf *strm_buf=cout.rdbuf();????
??? cout.rdbuf(outf.rdbuf());?
??? cout<<"write something to file"<<endl;?
??? cout.rdbuf(strm_buf);?? //recover?
??? cout<<"display something on screen"<<endl;
??? system("PAUSE");
??? return 0;
}

輸出到屏幕的是：

display something on screen

輸出到文件的是：

write something to file

也就是說，只要改變 ostream 的 rdbuf ，就可以重定向了，但是這招對 fstream 和 stringstream 都沒用。

--------------------------------------------------------------------------

關于 istream_iterator 和 ostream_iterator

經典的 ostream_iterator 例子，就是用 copy 來輸出：

#include <iostream>
#include <fstream>
#include <sstream>
#include <algorithm>
#include <vector>
#include <iterator>

using namespace std;

int main()
{??
??? vector<int> vect;
??? for ( int i = 1; i <= 9; ++i )
??????? vect.push_back(i);
???????
??? copy(vect.begin(), vect.end(),
??????? ostream_iterator<int>(cout, " ")
??? );
??? cout << endl;
???
??? ostream_iterator<double> os_iter(cout, " ~ ");
??? *os_iter = 1.0;
??? os_iter++;
??? *os_iter = 2.0;
??? *os_iter = 3.0;
}

輸出：

1 2 3 4 5 6 7 8 9
1 ~ 2 ~ 3 ~

很明顯，ostream_iterator 的作用就是允許對 stream 做 iterator 的操作，從而讓算法可以施加于 stream 之上，這也是 STL 的精華。與前面的“讀取文件”相結合，我們得到了顯示一個文件最方便的辦法：

??? copy(istreambuf_iterator<char>(ifs.rdbuf()),
???????? istreambuf_iterator<char>(),
???????? ostreambuf_iterator<char>(cout)
??? );

同樣，如果你用下面的語句，得到的會是沒有分隔符的輸出：

??? copy(istream_iterator<char>(ifs),
???????? istream_iterator<char>(),
???????? ostream_iterator<char>(cout)
??? );

那多半不是你要的結果。如果你硬是想用 istream_iterator 而不是 istreambuf_iterator 呢？還是有辦法：

??? copy(istream_iterator<char>(ifs >> noskipws),
???????? istream_iterator<char>(),
???????? ostream_iterator<char>(cout)
??? );

但是這樣不是推薦方法，它的效率比第一種低不少。
如果一個文件 temp.txt 的內容是下面這樣，那么我的這個從文件中把數據讀入 vector 的方法應該會讓你印象深刻。

12345 234 567
89??? 10

程序：

#include <iostream>
#include <fstream>
#include <algorithm>
#include <vector>
#include <iterator>

using namespace std;

int main()
{??
??? ifstream ifs("temp.txt");
???
??? vector<int> vect;
??? vect.assign(istream_iterator<int>(ifs),
??????? istream_iterator<int>()
??? );

??? copy(vect.begin(), vect.end(), ostream_iterator<int>(cout, " "));
}

輸出：

12345 234 567 89 10

很酷不是么？判斷文件結束、移動文件指針之類的苦工都有 istream_iterator 代勞了。

-----------------------------------------------------------------------

其它算法配合 iterator

計算文件行數：

??? int line_count =
??????? count(istreambuf_iterator<char>(ifs.rdbuf()),
????????????? istreambuf_iterator<char>(),
????????????? '\n');???????

當然確切地說，這是在計算文件中回車符的數量，同理，你也可以計算文件中任何字符的數量，或者某個 token 的數量：

??? int token_count =
??????? count(istream_iterator<string>(ifs),
????????????? istream_iterator<string>(),
????????????? "#include");???????

注意上面計算的是 “#include” 作為一個 token 的數量，如果它和其他的字符連起來，是不算數的。

------------------------------------------------------------------------
Manipulator

Manipulator 是什么？簡單的說，就是一個接受一個 stream 作為參數，并且返回一個 stream 的函數，比如上面的 unskipws ，它的定義是這樣的：

? inline ios_base&
? noskipws(ios_base& __base)
? {
??? __base.unsetf(ios_base::skipws);
??? return __base;
? }

這里它用了更通用的 ios_base 。知道了這一點，你大概不會對自己寫一個 manipulator 有什么恐懼感了，下面這個無聊的 manipulator 會忽略 stream 遇到第一個分號之前所有的輸入（包括那個分號）：

template <class charT, class traits>
inline std::basic_istream<charT, traits>&
ignoreToSemicolon (std::basic_istream<charT, traits>& s)
{
??? s.ignore(std::numeric_limits<int>::max(), s.widen(';'));
??? return s;
}

不過注意，它不會忽略以后的分號，因為 ignore 只執行了一次。更通用一點，manipulator 也可以接受參數的，下面這個就是 ignoreToSemicolon 的通用版本，它接受一個參數， stream 會忽略遇到第一個該參數之前的所有輸入，寫起來稍微麻煩一點：

struct IgnoreTo {
??? char ignoreTo;
??? IgnoreTo(char c) : ignoreTo(c)
??? {}
};
???
std::istream& operator >> (std::istream& s, const IgnoreTo& manip)
{
??? s.ignore(std::numeric_limits<int>::max(), s.widen(manip.ignoreTo));
??? return s;
}

但是用法差不多：

??? copy(istream_iterator<char>(ifs >> noskipws >> IgnoreTo(';')),
???????? istream_iterator<char>(),
???????? ostream_iterator<char>(cout)
??? );

其效果跟 IgnoreToSemicolon 一樣。

新用戶注冊刷新評論列表


只有注冊用戶登錄后才能發表評論。




網站導航: 博客園 IT新聞 Chat2DB C++博客博問管理
相關文章: c,c++水平相關如何閱讀別人代碼 [轉貼] 【轉貼】泛型算法：Tips 編譯boost庫-用vc2005編譯boost boost.asio 預處理器高手戲玩c++

Free mind

高手戲玩c++

常用鏈接

留言簿(1)

隨筆分類(36)

隨筆檔案(39)

blog

C/Cpp

Cluster

japanese

java

P2p

pattern

WTL

搜索

最新評論

評論排行榜